Induction of tumor immunity by variants of folate binding protein

ABSTRACT

The present invention is directed to variants of antigens comprising folate binding protein epitopes as a composition associated with providing immunity against a tumor in an individual. The variant is effective in inducing cytotoxic T-lymphocytes but preferably not to the extent that they become sensitive to silencing by elimination, such as by apoptosis, or by anergy, as in unresponsiveness.

The present application is a divisional application of U.S. application Ser. No. 10/094,097, filed Mar. 8, 2002 (now U.S. Pat. No. 7,547,759) which claims priority to U.S. Provisional Patent Application Ser. No. 60/274,676 filed Mar. 9, 2001, incorporated by reference herein in its entirety.

The government owns rights in the present invention pursuant to United States Army grant number DAMD 17-94-J-4313.

FIELD OF THE INVENTION

The present invention is directed to the fields of cancer and immunology. Specifically, the present invention is directed to compositions and methods for tumor vaccines directed to tumor antigens and is directed to specific epitopes on these antigens that are recognized by cytotoxic T-lymphocytes (CTL). More specifically, the present invention regards compositions and methods for variants of folate binding protein (FBP).

BACKGROUND OF THE INVENTION

Tumor reactive T-cells have been reported to mediate therapeutic responses against human cancers (Rosenberg et al., 1988). In certain instances, in human immunotherapy trials with tumor infiltrating lymphocytes (TIL) or tumor vaccines, these responses correlated either with in vitro cytotoxicity levels against autologous tumors (Aebersold et al., 1991) or with expression of certain HLA-A,B,C gene products (Marincola et al., 1992). Recent studies (Ioannides et al., 1992) have proposed that in addition to virally encoded and mutated oncogenes, overexpressed self-proteins may elicit some degree of tumor-reactive cytotoxic T-lymphocytes (CTLs) in patients with various malignancies (Ioannides et al., 1992; Ioannides et al., 1993; Brichard et al., 1993; Jerome et al., 1991). Autologous tumor reactive CTLs can be generated from lymphocytes infiltrating ovarian malignant ascites (Ioannides et al., 1991), and overexpressed proteins, such as HER-2, may be targets for CTL recognition (Ioannides et al., 1992).

T-cells play an important role in tumor regression in most murine tumor models. Tumor infiltrating lymphocytes (TIL) that recognize unique cancer antigens can be isolated from many murine tumors. The adoptive transfer of these TIL in addition to interleukin-2 can mediate the regression of established lung and liver metastases (Rosenberg et al., 1986). In addition, the secretion of IFN-γ by injected TIL significantly correlates with in vivo regression of murine tumors suggesting activation of T-cells by the tumor antigens (Barth et al., 1991). The known ability of TIL to mediate the regression of metastatic cancer in 35 to 40% of melanoma patients when adoptively transferred into patients with metastatic melanoma attests to the clinical importance of the antigens recognized (Rosenberg et al., 1988; Rosenberg, 1992).

Strong evidence that an immune response to cancer exists in humans is provided by the existence of tumor reactive lymphocytes within melanoma deposits. These lymphocytes, when isolated, are capable of recognizing specific tumor antigens on autologous and allogeneic melanomas in an MHC restricted fashion. (Itoh et al., 1986; Muul et al., 1987; Topalian et al., 1989; Darrow et al., 1989; Hom et al., 1991; Kawakami et al., 1992; Hom et al., 1993; O'Neil et al., 1993). TIL from patients with metastatic melanoma recognize shared antigens including melanocyte-melanoma lineage specific tissue antigens in vitro (Kawakami et al., 1993; Anichini et al. 1993). Anti-melanoma T-cells appear to be enriched in TIL, probably as a consequence of clonal expansion and accumulation at the tumor site in vivo (Sensi et al., 1993). The transduction of T-cells with a variety of genes, such as cytokines, has been demonstrated. T-cells have been shown to express foreign gene products. (Blaese, 1993; Hwu et al., 1993; Culver et al., 1991) The fact that individuals mount cellular and humoral responses against tumor associated antigens suggests that identification and characterization of additional tumor antigens is important for immunotherapy of patients with cancer.

T-cell receptors on CD8⁺ T-cells recognize a complex consisting of an antigenic peptide (9-10 amino acids for HLA-A2), β2 microglobulin and class I major histocompatibility complex (MHC) heavy chain (HLA-A, B, C, in humans). Peptides generated by digestion of endogenously synthesized proteins are transported into the endoplastic reticulum, bound to class I MHC heavy chain and β2 microglobulin, and finally expressed in the cell surface in the groove of the class I MHC molecule.

Information on epitopes of self-proteins recognized in the context of MHC Class I molecules remain limited, despite a few attempts to identify epitopes capable of in vitro priming and Ag-specific expansion of human CTLs. For example, peptide epitopes have been proposed which are likely candidates for binding on particular MHC Class I Ag (Falk et al., 1991), and some studies have attempted to define peptide epitopes which bind MHC Class I antigens.

Synthetic peptides have been shown to be a useful tool for T-cell epitope mapping. However in vivo and in vitro priming of specific CTLs has encountered difficulties (Alexander et al., 1991; Schild et al., 1991; Carbone et al., 1988). It is generally considered that in vitro CTL priming cannot necessarily be achieved with peptide alone, and in fact, a high antigen density is thought to be required for peptide priming (Alexander et al., 1991). Even in the limited instances when specific priming was achieved, APC or stimulators were also required at high densities (Alexander et al., 1991).

Short synthetic peptides have been used either as target antigens for epitope mapping or for induction of in vitro primary and secondary CTL responses to viral and parasitic Ags (Bednarek et al., 1991; Gammon et al., 1992; Schmidt et al., 1992; Kos and Müllbacher, 1992; Hill et al., 1992). Unfortunately, these studies failed to show the ability of proto-oncogene peptide analogs to stimulate in vitro human CTLs to lyse tumors endogenously expressing these antigens.

Identification of tumor antigens (Ag) and of specific epitopes on these Ag recognized by cytotoxic T-lymphocytes enables the development of tumor vaccines (for review of tumor antigens, see Rosenberg (2000), incorporated by reference herein). Tumor Ag are weak or partial agonists for activation of low-avidity (low-affinity) CTL. Attempts to activate CTL by increasing the affinity of peptide for MHC (by modifications in the anchor residues) has produced mixed successes even with powerful APC (dendritic cells, DC) and added B7 costimulation. Some of the resulting cross-reactive CTL recognized tumors with lower affinity than CTL induced by wild type Ag.

The limited ability of anchor-fixed immunogens to induce and expand high-affinity CTL raises the need for alternative approaches for CTL induction. One approach to this question is to design immunogens which activate “high-affinity” CTL from the existent pool of responders. In human tumor immunology, this approach has been successful in some instances. However, high-affinity CTL are expected to be more sensitive to silencing by elimination (e.g. apoptosis) or by anergy (unresponsiveness or diminished reactivity to a specific antigen).

These processes occur as a consequence of recurrent stimulations with Ag (tumor Ag) and are amplified by a number of cytokines. The general mechanism of activation induced cell death (AICD) is that repeated stimulations with an Ag in the presence of cytokines such as IL-2 activates cell death pathways. This is because stimulation with Ag and IL-2 transduces a signal which is too strong to induce proliferation and instead leads to premature senescence. An alternative death pathway, passive cell death (PCD) occurs when cytokines involved in survival (IL-2, IL-4, IL-7, etc.) are withdrawn. Since tumor Ag are self-Ag, the corresponding responding cells should be even more sensitive to deletion than CTL responding to foreign Ag, because the body's defense mechanisms are programmed to avoid autoimmunity. There is little known as to how the survival of responders to tumor Ag can be induced, and how they can be protected from AICD or PCD.

Preclinical and clinical trials are underway for the utilization of tumor-specific peptide epitopes for melanoma (Rivoltini et al., 1999; Parkhurst et al., 1998; Kawakami et al., 1998; Lustgarten et al., 1997; Zeng et al., 1997; Reynolds et al., 1998; Nestle et al., 1998; Chakraborty et al., 1998; Rosenberg et al., 1998); breast cancer, such as with MUC1 (Gendler et al., 1998; Xing et al., 1989; Xing et al., 1990; Jerome et al., 1993; Apostolopoulos et al., 1994; Ding et al., 1993; Zhang et al., 1996; Acres et al., 1993; Henderson et al., 1998; Henderson et al., 1996; Samuel et al., 1998; Gong et al., 1997; Apostolopoulos et al., 1995; Pietersz et al., 1998; Lofthouse et al., 1997; Rowse et al., 1998; Gong et al., 1998; Acres et al., 1999; Apostolopoulos et al., 1998; Lees et al., 1999; Xing et al., 1995; Goydos et al., 1996; Reddish et al., 1998; Karanikas et al., 1997), p53 (DeLeo, 1998; McCarty et al., 1998; Hurpin et al., 1998; Gabrilovich et al., 1996), and Her-2/neu (Disis and Cheever, 1998; Ioannides et al., 1993; Fisk et al., 1995; Peoples et al., 1995; Kawashima et al., 1999; Disi et al., 1996); and colon cancer (Kantor et al., 1992; Kantor et al., 1992; Tsang et al., 1995; Hodge et al., 1997; Conry et al., 1998; Kass et al., 1999; Zaremba et al., 1997; Nukaya et al., 1999).

Recently, peptides of folate binding protein (FBP) were recognized by tumor-associated lymphocytes (Peoples et al., 1998; Peoples et al., 1999; Kim et al., 1999). FBP is a membrane-associated glycoprotein originally found as a mAb-defined Ag in placenta and trophoblastic cells but rarely in other normal tissues (Retrig et al., 1985; Elwood, 1989; Weitman et al., 1992; Garin-Chesa et al., 1993). Of interest, this protein has been found in greater than 90% of ovarian and endometrial carcinomas; in 20-50% of breast, colorectal, lung, and renal cell carcinomas; and in multiple other tumor types. When present in cancerous tissue, the level of expression is usually greater than 20-fold normal tissue expression and has been reported to be as high as 80-90-fold in ovarian carcinomas (Li et al., 1996).

U.S. Pat. No. 5,846,538 is directed to immune reactivity to peptides of HER-2/neu protein for treatment of malignancies.

Folate binding protein provides an ideal target for and satisfies a long-felt need in the art for compositions and methods of utilizing the compositions directed to tumor immunity.

SUMMARY OF THE INVENTION

It is an object of the present invention to provide as a composition of matter an antigen comprising a folate binding protein epitope of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7 and SEQ ID NO:8.

It is another object of the present invention to provide a composition comprising an antigen which includes a folate binding protein epitope of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or a combination thereof in a pharmaceutically acceptable excipient.

It is another object of the present invention to provide a method for stimulating cytotoxic T-lymphocytes, comprising the step of contacting the cytotoxic T-lymphocytes with an amount of an antigen comprising a folate binding protein epitope selected from the group consisting of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, and a combination thereof, wherein the amount is effective to stimulate the cytotoxic T-lymphocytes. In a specific embodiment of the present invention, the cytotoxic T-lymphocytes are located within a human. In another specific embodiment, the method further comprises the step of administering to the human an antigen comprising a folate binding protein epitope selected from the group consisting of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, and a combination thereof. In another specific embodiment of the present invention, the epitope is formulated for administration parenterally, topically, or as an inhalant, aerosol or spray.

It is an additional object of the present invention to provide a method of generating an immune response, comprising the step of administering to a human a pharmaceutical composition comprising an immunologically effective amount of a composition comprising an antigen comprising a folate binding epitope of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or a combination thereof.

It is another object of the present invention to provide a method of inducing immunity against a tumor in an individual, comprising the steps of administering to the individual an antigen comprising a folate binding protein epitope of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or a combination thereof; and administering to the individual a cancer vaccine. In a specific embodiment of the present invention, the an antigen comprising a folate binding protein epitope is administered prior to the administration of the cancer vaccine. In a specific embodiment of the present invention, an antigen comprising a folate binding protein epitope is administered subsequent to the administration of the cancer vaccine. In another specific embodiment of the present invention, the antigen comprising a folate binding protein epitope is administered both prior to and subsequent to the administration of the cancer vaccine. In a further specific embodiment, the cancer vaccine comprises a polypeptide selected from the group consisting of SEQ ID NO:268 (E39) and SEQ ID NO:269 (E41).

It is another object of the present invention to provide a method of inducing memory cytotoxic T-lymphocytes in an individual comprising the step of administering an antigen comprising a folate binding epitope of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or a combination thereof. In a specific embodiment, the individual is substantially susceptible to recurrence of cancer.

It is another object of the present invention to provide a method of providing immunity against a tumor comprising the step of administering an antigen comprising a folate binding epitope vaccine of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or a combination thereof.

It is another object of the present invention to provide a method of treating an individual for cancer comprising the steps of administering to the individual a first cancer vaccine; and administering to the individual a second cancer vaccine comprising a peptide selected from the group consisting of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or a combination thereof. In a specific embodiment, the first cancer vaccine administration step precedes the second cancer vaccine administration step. In another specific embodiment, the first cancer vaccine administration step is subsequent to the second cancer vaccine administration step.

It is an additional object of the present invention to provide a pharmaceutical composition comprising an antigen comprising a folate binding protein epitope selected from the group consisting of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or a combination thereof in a pharmaceutically acceptable excipient.

It is another object of the present invention to provide a method of treating a proliferative cell disorder in a human, comprising administering to the human a therapeutically effective amount of a pharmaceutical composition comprising an antigen comprising a folate binding protein epitope selected from the group consisting of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or a combination thereof in a pharmaceutically acceptable excipient. In a specific embodiment, the proliferative cell disorder is cancer. In an additional specific embodiment, the cancer is breast cancer, ovarian cancer, endometrial cancer, colorectal cancer, lung cancer, renal cancer, melanoma, kidney cancer, prostate cancer, brain cancer, sarcomas, or a combination thereof.

BRIEF DESCRIPTION OF THE FIGURES

The following drawings form part of the present specification and are included to further demonstrate certain aspects of the present invention. The invention may be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein.

FIG. 1 demonstrates HLA-A2 stabilization by FBP epitope E39 variants.

FIG. 2A illustrates IFN-γ induction in peripheral blood mononuclear cells (PBMC) with multiple stimulations with J65 or E39.

FIG. 2B illustrates CTL activity in PBMC with multiple stimulations with J65 or E39.

FIG. 3 illustrates specific interleukin 2 (IL-2) induction in PBMCs by priming with E39 variants.

FIG. 4 illustrates expansion of PBMCs stimulated with FBP peptide E39 and its variants.

FIG. 5 demonstrates expansion of PBMC stimulated with variants of the FBP peptide E39.

DETAILED DESCRIPTION OF THE INVENTION I. Definitions

As used herein the specification, “a” or “an” may mean one or more. As used herein in the claim(s), when used in conjunction with the word “comprising”, the words “a” or “an” may mean one or more than one. As used herein “another” may mean at least a second or more.

The term “antigen” as used herein is defined as an entity which elicits an immune system response. The term herein may be abbreviated to “Ag.”

The term “cancer” as used herein is defined as a tissue of uncontrolled growth or proliferation of cells, such as a tumor. In a specific embodiment, the cancer is an epithelial cancer. In specific embodiments, the cancer is breast cancer, ovarian cancer, endometrial cancer, colorectal cancer, lung cancer, renal cancer, melanoma, kidney cancer, prostate cancer, brain cancer, sarcomas, or a combination thereof. In specific embodiments, such cancers in mammals are caused by chromosomal abnormalities, degenerative growth and/or developmental disorders, mitogenic agents, ultraviolet radiation (uv), viral infections, inappropriate tissue expression of a gene, alterations in expression of a gene, carcinogenic agents, or a combination thereof. The term melanoma includes, but is not limited to, melanomas, metastatic melanomas, melanomas derived from either melanocytes or melanocyte related nevus cells, melanocarcinomas, melanoepitheliomas, melanosarcomas, melanoma in situ, superficial spreading melanoma, nodular melanoma, lentigo maligna melanoma, acral lentiginous melanoma, invasive melanoma or familial atypical mole and melanoma (FAM-M) syndrome. The aforementioned cancers can be treated by methods described in the present application.

The term “epitope” as used herein is defined as a short peptide derived from a protein antigen which binds to an MHC molecule and is recognized by a particular T cell.

The term “folate binding protein variant” as used herein is defined as a folate binding protein and peptides thereof which are preferably recognized by helper T cells or cytotoxic T cells and may be naturally derived, synthetically produced, genetically engineered, or a functional equivalent thereof, e.g. where one or more amino acids may be replaced by other amino acid(s) or non-amino acid(s) which do not substantially affect function. In specific embodiments, the peptides are epitopes which contain alterations, modifications, or changes in comparison to SEQ ID NO:268 (E39) or SEQ ID NO:269 (E41). In further specific embodiments, the variants are of SEQ ID NO:1 through SEQ ID NO:8.

The term “immune response” as used herein refers to a cellular immune response, including eliciting stimulation of T lymphocytes, macrophages, and/or natural killer cells.

The term “immunity” as used herein is defined as the ability to provide resistance to a tumor resulting from exposure to an antigen that is a folate binding protein epitope, such as the folate binding protein variants described herein.

The term “vaccine” as used herein is defined as a composition for generating immunity to a cancer. In specific embodiments, the cancer vaccine is a wild-type epitope of folate binding protein, such as E39 (FBP amino acid residues 191-199) (SEQ ID NO:268) or E41 (FBP amino acid residues 245-253) (SEQ ID NO:269). In other specific embodiments, the cancer vaccine comprises SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, or SEQ ID NO:8, or a combination thereof. In a preferred embodiment, administration of the vaccine alternates the signaling through the T cell receptor, thereby reducing the possibility of apoptosis.

The term “variant” as used herein is defined as a modified or altered form of a wildtype sequence, such as the folate binding protein E39 epitope (SEQ ID NO:268). The variant may contain replacement of at least one amino acid residue or may contain an altered side chain for at least one amino acid residue.

II. The Present Invention A. Specific Embodiments

The present invention is directed to folate binding protein tumor Ag modified to attenuate the signaling through T cell receptors, compared with a wild-type folate binding protein tumor Ag, particularly for reducing the possibility of apoptosis that results following repeated exposure to strong antigens. Thus, variants of folate binding protein epitopes such as E39 (SEQ ID NO:268) and E41 (SEQ ID NO:269), which are “strong” antigens, are modified to act as a “weak” antigen. Thus, the present invention utilizes compositions and methods to attenuate signaling through the T cell receptors.

The invention works as (1) prestimulation prevaccine, to be administered before the tumor Ag; (2) as post vaccine to be given after the tumor Ag; and/or (3) in certain individuals will work as a priming vaccine. The situations (1) and (2) are more related to a protective role for SEQ ID NO:6 (J65) and its analogs for tumor reactive CTL. The situation (3) can be encountered in certain individuals where mutations in the histocompatibility Ag binding pocket may transform an attenuator into a strong immunogen.

The invention allows protection before and after vaccination of either precursors (stand-in) or activated effectors. In specific embodiments, administration of the variants of folate binding protein provide targeted induction of memory CTL.

The variants described herein, in a particular embodiment SEQ ID NO:6, are intended to attenuate the signaling at recurrent stimulation, thus inducing protection of CTL precursors as of activated T-cells from apoptosis, thereby enabling the immune response to expand, and, in preferred embodiments, have important implications in induction of memory CTL.

It is well known that the two major arms of the immune system are: (1) cell-mediated immunity with immune T cells; and (2) humoral immunity with antibodies. Further, the immune system normally functions to recognize and destroy any foreign or aberrant cells in the body. Since FBP is expressed by some normal cells, tolerance and/or anergy is expected.

Development of molecular therapies for cancer have historically focused on specific recognition of Ags by cellular immune effectors. The present invention discloses novel strategies aimed at identification of peptide targets for CTLs, and generation of T-cell immunity against specific epitopes (for a review of T-cell specific immunity, see, e.g., Ioannides et al., 1992; Houbiers et al., 1993).

To achieve this, the present invention provides novel naturally- and synthetically-derived peptides which bind human leucocyte antigen-(HLA) class I heavy chains. Appropriate criteria for epitope selection in vitro have been defined, and synthetic peptides based on immunogenic epitopes of FBP have also been produced.

Although the dominant anchors for peptide binding to HLA-A2 are Leu (P2) and Val (P9), a number of residues with similar charge and side chains, such as Ile and/or Met, were identified in CTL epitopes from viral proteins (Falk et al., 1991; Bednarek et at, 1991).

B. General Embodiments

1. CTL Epitopes

CTL epitopes reported to date are mainly derived from foreign (viral) proteins with little or no homology to self-proteins. With respect to CTL responses to self-proteins, it is expected that T-cells expressing TCR with high affinity for self-peptide-MHC class I complexes are eliminated in the thymus during development. Self-peptides eluted from HLA-A2.1 molecules of various cell lines show residues at P3-P5 and P7-P8 which are different from the sequences of viral epitopes recognized by human CTLs. Since these residues are likely to contact and interact with TCR, they may reflect peptides for which autologous T-cells are already tolerant/anergic.

For T-cell recognizing self-epitopes to be eliminated or anergized, a precondition exists that the peptide-MHC complex is stable enough to engage a sufficient number of TCRs, or at least more stable than other HLA-A2 peptide complexes, where one peptide can be easily displaced by other peptides. Consequently, this would suggest that for self-proteins with extension to FBP, the ones that can bind TCR with high affinity during development will be less likely to be recognized later when expressed on a tumor other target, than peptides that bind HLA-A2 with low affinity, which under appropriate conditions (e.g., high protein concentration) may occupy a higher number of HLA-A2 molecules. For low-affinity peptides, modification of the anchors resulting in stabilization of peptide—HLA-A2 interaction by replacing weak with dominant anchor residues (e.g., (P9) M

V, should facilitate the reactivity of CTL with targets expressing such antigens, because TCR interacts mainly with the sequence P4-P8.

Tumor progression and metastasis are often associated with overexpression of specific cellular proteins. Epitopes of non-mutated overexpressed proteins can be targets of a specific cellular immune response against tumor mediated by T-cells. Moreover, when T-cell epitopes are present, distinction between tumor immunity/autoimmunity and unresponsiveness can be predicated on the protein concentration as a limiting factor of epitope supply.

2. Epitopic Core Sequences

The present invention is also directed to protein or peptide compositions, free from total cells and other peptides, which comprise a purified protein or peptide which incorporates an epitope that is immunologically recognized by a CTL.

As used herein, the term “incorporating an epitope(s) that is immunologically recognized by a CTL” is intended to refer to a peptide or protein antigen which includes a primary, secondary or tertiary structure similar to an epitope located within a FBP polypeptide. The level of similarity will generally be to such a degree that the same population of CTLs will also bind to, react with, or otherwise recognize, the cross-reactive peptide or protein antigen.

The identification of CTL-stimulating immunodominant epitopes, and/or their functional equivalents, suitable for use in vaccines is a relatively straightforward matter. For example, one may employ the methods of Hopp, as taught in U.S. Pat. No. 4,554,101, incorporated herein by reference, which teaches the identification and preparation of epitopes from amino acid sequences on the basis of hydrophilicity. The methods described in several other papers, and software programs based thereon, can also be used to identify epitopic core sequences (see, for example, Jameson and Wolf, 1988; Wolf et al., 1988; U.S. Pat. No. 4,554,101). The amino acid sequence of these “epitopic core sequences” may then be readily incorporated into peptides, either through the application of peptide synthesis or recombinant technology.

Preferred peptides for use in accordance with the present invention will generally be on the order of 8 to 20 amino acids in length, and more preferably about 8 to about 15 amino acids in length. It is proposed that shorter antigenic CTL-stimulating peptides will provide advantages in certain circumstances, for example, in the preparation of vaccines or in immunologic detection assays. Exemplary advantages include the ease of preparation and purification, the relatively low cost and improved reproducibility of production, and advantageous biodistribution.

It is proposed that particular advantages of the present invention may be realized through the preparation of synthetic peptides which include modified and/or extended epitopic/immunogenic core sequences which result in a “universal” epitopic peptide directed to FBP sequences. These epitopic core sequences are identified herein in particular aspects as hydrophilic regions of the FBP polypeptide antigen. It is proposed that these regions represent those which are most likely to promote T-cell or B-cell stimulation, and, hence, elicit specific antibody production.

An epitopic core sequence, as used herein, is a relatively short stretch of amino acids that is “complementary” to, and therefore will bind, receptors on CTLs. It will be understood that in the context of the present disclosure, the term “complementary” refers to amino acids or peptides that exhibit an attractive force towards each other.

In general, the size of the polypeptide antigen is not believed to be particularly crucial, so long as it is at least large enough to carry the identified core sequence or sequences. The smallest useful core sequence anticipated by the present disclosure would generally be on the order of about 8 amino acids in length, with sequences on the order of 9 or 10 being more preferred. Thus, this size will generally correspond to the smallest peptide antigens prepared in accordance with the invention. However, the size of the antigen may be larger where desired, so long as it contains a basic epitopic core sequence.

A skilled artisan recognizes that numerous computer programs are available for use in predicting antigenic portions of proteins (see e.g., Jameson & Wolf, 1988; Wolf et al., 1988). Computerized peptide sequence analysis programs (e.g., DNAStar Software, DNAStar, Inc., Madison, Wis.) may also be useful in designing synthetic peptides in accordance with the present disclosure.

Syntheses of epitopic sequences, or peptides which include an antigenic epitope within their sequence, are readily achieved using conventional synthetic techniques such as the solid phase method (e.g., through the use of commercially available peptide synthesizer such as an Applied Biosystems Model 430A Peptide Synthesizer). Peptide antigens synthesized in this manner may then be aliquoted in predetermined amounts and stored in conventional manners, such as in aqueous solutions or, even more preferably, in a powder or lyophilized state pending use.

In general, due to the relative stability of peptides, they may be readily stored in aqueous solutions for fairly long periods of time if desired, e.g., up to six months or more, in virtually any aqueous solution without appreciable degradation or loss of antigenic activity. However, where extended aqueous storage is contemplated it will generally be desirable to include agents including buffers such as Tris or phosphate buffers to maintain a pH of about 7.0 to about 7.5. Moreover, it may be desirable to include agents which will inhibit microbial growth, such as sodium azide or Merthiolate. For extended storage in an aqueous state it will be desirable to store the solutions at 4° C., or more preferably, frozen. Of course, where the peptides are stored in a lyophilized or powdered state, they may be stored virtually indefinitely, e.g., in metered aliquots that may be rehydrated with a predetermined amount of water (preferably distilled) or buffer prior to use.

3. T lymphocytes

T lymphocytes recognize antigen in the form of peptide fragments that are bound to class I and class II molecules of the major histocompatibility complex (MHC) locus. Major Histocompatibility Complex (MHC) is a generic designation meant to encompass the histocompatibility antigen systems described in different species including the human leucocyte antigens (HLA). The T-cell receptor for antigen (TCR) is a complex of at least 8 polypeptide chains. (“Basic and Clinical Immunology” (1994) Stites, Terr and Parslow (eds) Appleton and Lange, Nenmack Conn.) Two of these chains (the alpha and beta chains) form a disulfide-linked dimer that recognizes antigenic peptides bound to MHC molecules and therefore is the actual ligand-binding structure within the TCR. The TCR alpha and beta chains are similar in many respects to immunoglobulin proteins. The amino-terminal regions of the alpha and beta chains are highly polymorphic, so that within the entire T-cell population there are a large number of different TCR alpha/beta dimers, each capable of recognizing or binding a particular combination of antigenic peptide and MHC.

In general, CD4⁺ T cell populations are considered to function as helpers/inducers through the release of lymphokines when stimulated by a specific antigen; however, a subset of CD4⁺ cells can act as cytotoxic T lymphocytes (CTL). Similarly, CD8⁺ T cells are considered to function by directly lysing antigenic targets; however, under a variety of circumstances they can secrete lymphokines to provide helper or DTH function. Despite the potential of overlapping function, the phenotypic CD4 and CD8 markers are linked to the recognition of peptides bound to class II or class I MHC antigens. The recognition of antigen in the context of class II or class I MHC mandates that CD4⁺ and CD8⁺ T cells respond to different antigens or the same antigen presented under different circumstances. The binding of immunogenic peptides to class II MHC antigens most commonly occurs for antigens ingested by antigen presenting cells. Therefore, CD4⁺ T cells generally recognize antigens that have been external to the tumor cells. By contrast, under normal circumstances, binding of peptides to class I MHC occurs only for proteins present in the cytosol and synthesized by the target itself, proteins in the external environment are excluded. An exception to this is the binding of exogenous peptides with a precise class I binding motif which are present outside the cell in high concentration. Thus, CD4⁺ and CD8⁺ T cells have broadly different functions and tend to recognize different antigens as a reflection of where the antigens normally reside.

As disclosed within the present invention, the protein product expressed by FBP is recognized by T cells. Such a protein expression product “turns over” within cells, i.e., undergoes a cycle wherein a synthesized protein functions and then eventually is degraded and replaced by a newly synthesized molecule. During the protein life cycle, peptide fragments from the protein bind to major histocompatibility complex (MHC) antigens. By display of a peptide bound to MHC antigen on the cell surface and recognition by host T cells of the combination of peptide plus self MHC antigen, a malignant cell will be immunogenic to T cells. The exquisite specificity of the T cell receptor enables individual T cells to discriminate between protein fragments which differ by a single amino acid residue.

During the immune response to a peptide, T cells expressing a T cell receptor with high affinity binding of the peptide-MHC complex will bind to the peptide-MHC complex and thereby become activated and induced to proliferate. In the first encounter with a peptide, small numbers of immune T cells will secrete lymphokines, proliferate and differentiate into effector and memory T cells. Subsequent encounters with the same antigen by the memory T cell will lead to a faster and more intense immune response.

Intact folate binding protein or peptides thereof which are recognized by cytotoxic T cells may be used within the present invention. The peptides may be naturally derived or produced based upon an identified sequence. The peptides for CD8⁺ T cell responses (elicited by peptides presented by folate binding protein class I MHC molecules) are generally about 8-10 amino acids in length. Peptides for CD8⁺ T cell responses vary according to each individual's class I MHC molecules. Examples of peptides suitable within the present invention for CD8⁺ T cell responses include peptides comprising or consisting of SEQ ID NO:1 through SEQ ID NO:8.

It will be evident to those of ordinary skill in the art that other peptides may be produced for use within the present invention, both for class I MHC molecules as well as for class II molecules. A variety of techniques are well known for isolating or constructing peptides. Suitable peptides are readily identified based upon the disclosure provided herein. Additional suitable peptides include those which are longer in length. Such peptides may be extended (e.g., by the addition of one or more amino acid residues and/or truncated (e.g., by the deletion of one or more amino acid residues from the carboxyl terminus). Alternatively, suitable peptides may be variations on other preferred peptides disclosed herein. Although this particular peptide variation may result in a peptide with the same number of total amino acids (such as nine), a peptide variation on a preferred peptide need not be identical in length. Variations in amino acid sequence that yield peptides having substantially the same desired biological activity are within the scope of the present invention.

Immunization of an individual with a FBP peptide (i.e., as a vaccine) can induce continued expansion in the number of T cells necessary for therapeutic attack against a tumor in which FBP is associated. Typically, about 0.01 μg/kg to about 100 mg/kg body weight will be administered by the intradermal, subcutaneous or intravenous route. A preferred dosage is about 1 μg/kg to about 1 mg/kg, with about 5 μg/kg to about 200 μg/kg particularly preferred. It will be evident to those skilled in the art that the number and frequency of administrations will be dependent upon the response of the patient. It may be desirable to administer the FBP peptide repetitively. It will be evident to those skilled in this art that more than one FBP peptide may be administered, either simultaneously or sequentially. For example, a combination of about 8-15 peptides may be used for immunization. Preferred peptides for immunization are those that include all or a portion of at least one FBP amino acid SEQ ID NO:1 through SEQ ID NO:68, or variants thereof. One or more peptides from other portions of the amino acid sequence shown in SEQ ID NO:1 through SEQ ID NO:68 may be added to one or more of the preferred peptides.

In addition to the FBP peptide (which functions as an antigen), it may be desirable to include other components in the vaccine, such as a vehicle for antigen delivery and immunostimulatory substances designed to enhance the protein's immunogenicity. Examples of vehicles for antigen delivery include aluminum salts, water-in-oil emulsions, biodegradable oil vehicles, oil-in-water emulsions, biodegradable microcapsules, and liposomes. Examples of immunostimulatory substances (adjuvants) include N-acetylmuramyl-L-alanine-D-isoglutamine (MDP), lipopoly-saccharides (LPS), glucan, IL-12, GM-CSF, gamma interferon and IL-15. It will be evident to those skilled in this art that a FBP peptide may be prepared synthetically or that a portion of the protein (naturally-derived or synthetic) may be used. When a peptide is used without additional sequences, it may be desirable to couple the peptide hapten to a carrier substance, such as keyhole limpet hemocyanin.

The methods and compositions of the present invention are particularly well-suited for inducing an immune response in a patient who has developed resistance to conventional cancer treatments or who has a high probability of developing a recurrence following treatment. A skilled artisan recognizes that cancer cells are able to evade the immune system or evade an effective immune response because they look like self, they actively anergize the immune system to any antigens which may potentially differentiate between self and tumor, and they may create an immunosuppressive environment by secreting immunosuppressive factors and/or by expressing factors which can induce apoptosis of an offensive tumor antigen-specific killer cell.

A skilled artisan is aware of multiple reviews concerning cancer vaccines and the generation of cellular immune responses to antigenic tumor peptides (Pietersz et al., 2000; Pardoll, 2000; Rosenberg, 2000; Dalgleish, 2000, each of which are incorporated by reference herein).

A skilled artisan recognizes that the antigen can be produced in large amounts by recombinant technology, either as soluble molecules in eukaryotic systems or as fusion proteins in bacterial systems. In a specific embodiment, synthetic peptides are made from the tumor antigen. Furthermore, monoclonal antibodies to the tumor antigens are useful in their identification and purification.

In a peptide approach to tumor immunotherapy, peptides (such as about 8-9mers) are presented by MHC class I molecules, leading to the generation of CD8⁺-mediated cellular responses comprising CTLs and cytokine secretion, mostly in the form of IFN-γ and TNF-α.

A skilled artisan recognizes that the dendritic cell is important in generating CD8⁺ CTLs following class I presentation. Esche et al. (1999) demonstrated techniques whereby dendritic cells are obtained from patients, isolated, expanded in vitro, exposed to the peptides and reintroduced into the patient. Others utilize similarly treated dendritic cells for generation of specifically activated T cells in vitro before transfer.

A crucial initial step in CD8⁺ T cell generation is the uptake and presentation of peptides by MHC molecules by antigen-presenting cells. MHC class I proteins consist of three subunits, all of which are important for the formation of a stable complex. X-ray crystallography of MHC class I molecules has demonstrated that interaction of peptides with the MHC class I groove is determined by the peptide sequence, with discrete amino acids interacting with pockets in the MHC groove (which have a fixed spacing from each other) and also have specificity for anchoring amino acid side chains. Although there are exceptions, the amino and carboxy termini of the peptides are anchored at either end of the groove, often in positions 2 or 3, 5 or 7 (Apostolopoulos et al., 1997a; Apostolopoulos et al., 1997b). The peptides also interact with the T cell receptor, yet only a small amount of the peptide is exposed (Apostolopoulos et al., 1998).

Given that multiple peptide tumor antigens, such as folate binding protein, have been identified in addition to characterization of T cell epitopes, in a specific embodiment of the present invention peptide antigens are generated synthetically for immunization. The immunogenicity of small peptides can be improved upon by increasing the peptide size, by binding to carriers and also by using adjuvants to activate macrophages and other immune system factors. A skilled artisan is cognizant of recombinant cytokines being used to increase immunogenicity of a synthetic peptide (Tao and Levy, 1993) and furthermore that cytokines can also be directly fused to peptides (Nakao et al., 1994; Disis et al., 1996; Chen et al., 1994).

In specific embodiments of the present invention, mixtures of separate peptides are administered as a vaccine. Alternatively, multiple epitopes may be incorporated into the same molecule by recombinant technology well known in the art (Mateo et al., 1999; Astori and Krachenbuhl, 1996). In another embodiment, a combinatorial peptide library is used to increase binding peptides by utilizing different amino acids at least one anchor location.

In another embodiment of the present invention, natural amino acids of a peptide are replaced with unnatural D-amino acids; alternatively, the peptide residues are assembled in reverse order, which renders the peptides resistant to proteases (Briand et al., 1997; Herve et al., 1997; Bartnes et al., 1997; Guichard et al., 1996). In another embodiment, unnatural modified amino acids are incorporated into a peptide, such as α-aminoisobutyric acid or N-methylserine.

A skilled artisan recognizes that the binding strength of the 8- or 9-mer to the MHC complex and the subsequent recognition by the T cell receptor determines the immunogenicity of CTL peptides. Van Der Burg et al. (1993) determined that the longer the peptide remains bound to the MHC complex, the better the chance it will induce a T cell response. A skilled artisan also recognizes that there are methods for introducing extraneous peptides directly into the cytoplasm of a cell to allow generation of class I-restricted cellular immune responses. One example includes microbial toxins, which can carry peptides in their cytoplasm for delivery because they enter cells by receptor-mediated endocytosis and thereby deposit cellular toxins into the cytoplasm. Specific examples include shiga toxin (Lee et al., 1998), anthrax toxin (Goletz et al., 1997), diphtheria toxin (Stenmark et al., 1991), Pseudomonas exotoxin (Donnelly et al., 1993), and Bordetella pertussis toxin (Fayolle et al., 1996).

In alternative embodiments, peptides enter cells through membrane fusion and are beneficial for delivering tumor or other peptides into a cell cytoplasm, including Antennapedia (Derossi et al., 1994; Derossi et al., 1996; Schutze-Redelmeier et al., 1996), Tat protein (Kim et al., 1997), and Measles virus fusion peptide (Partidos et al., 1997).

In other embodiments, peptides are introduced into a cytoplasm through lipopeptides, which comprise both a lipid and a peptide, by direct insertion into the lipophilic cell membrane (BenMohamed et al., 1997; Obert et al., 1998; Deprez et al., 1996; Beekman et al., 1997). In alternative embodiments, the peptides are delivered in liposomes (for examples, see Nakanishi et al., 1997; Noguchi et al., 1991; Fukasawa et al., 1998; Guan et al., 1998), whereby the immunogenicity is dependent on the size, charge, lipid composition of the liposome itself, and whether or not the antigen is present on the surface of the liposome or within its interior.

A skilled artisan also recognizes that immune-stimulating complexes (ISCOMs), which comprise Quill A (a mixture of saponins), cholesterol, phospholipid, and proteins, are useful for delivering naturally hydrophobic antigens or antigens made hydrophobic by the addition of myristic or palmitic acid tails (for examples, see Hsu et al., 1996; Sjolander et al., 1997; Villacres-Eriksson, 1995; Tarpey et al., 1996; Rimmelzwaan et al., 1997). ISCOMs facilitate penetration into cells by fusion with their membranes, by endocytosis, or by phagocytosis.

Antigens may also be directed to particular subcellular compartments through incorporation of sorting signals to the antigen by recombinant technology, including Class II LAMP-I (Rowell et al., 1995; Wu et al., 1995), ER targeting peptide (Minev et al., 1994); CLIP (Malcherik et al., 1998), and heat shock proteins (Udono and Srivastava, 1993; Heike et al., 1996; Zhu et al., 1996; Suzue et al., 1997; Ciupitu et al., 1998).

A skilled artisan recognizes that the present invention provides anti-cancer therapeutic compositions comprising a variety of peptides designated for CD8⁺ T cell responses comprising SEQ ID NO:1 through SEQ ID NO:8, or a combination thereof. A skilled artisan also recognizes that the present invention provides anti-cancer therapeutic compositions comprising a variety of peptides designated for CD8⁺ T cell responses consisting essentially of SEQ ID NO:1 through SEQ ID NO:8, or a combination thereof.

A skilled artisan recognizes that references such as Abrams and Schlom (2000) summarize the current views on rational Ag modification. Two types of peptides are described: (1) agonistic peptides which upregulate Ag-specific responses; (2) antagonistic/partial agonistic peptides which downregulate the same responses. However, it is an object of the present invention to provide therapy which stimulate Ag-specific immune responses while at the same time does not elicit activation induced-cell death or death by neglect.

A skilled artisan recognizes that sequences that encode folate binding protein epitopes for induction of tumor immunity can be obtained from databases such as the National Center for Biotechnology Informations's GenBank® database or commercially available databases, such as that of Celera Genomics, Inc. (Rockville, Md.). Examples of folate binding protein sequences which may comprise an epitope or which can be altered to comprise an epitope include the following, denoted by GenBank® Accession numbers: P14207 (SEQ ID NO:9); P15328 (SEQ ID NO:10); P13255 (SEQ ID NO:11); NP_(—)000793 (SEQ ID NO:12); AAB05827 (SEQ ID NO:13); AAG36877 (SEQ ID NO:14); S42627 (SEQ ID NO:15); S00112 (SEQ ID NO:16); BFBO (SEQ ID NO:17); S62670 (SEQ ID NO:18); S62669 (SEQ ID NO:19); A55968 (SEQ ID NO:20); A45753 (SEQ ID NO:21); A33417 (SEQ ID NO:22); B40969 (SEQ ID NO:23); A40969 (SEQ ID NO:24); NP_(—)057943 (SEQ ID NO:25); NP_(—)057942 (SEQ ID NO:26); NP_(—)057941 (SEQ ID NO:27); NP_(—)057937 (SEQ ID NO:28); NP_(—)057936 (SEQ ID NO:29); NP_(—)037439 (SEQ ID NO:30); NP_(—)032061 (SEQ ID NO:31); NP_(—)032060 (SEQ ID NO:32); NP_(—)000795 (SEQ ID NO:33); NP_(—)000794 (SEQ ID NO:34); AAF66225 (SEQ ID NO:35); BAA37125 (SEQ ID NO:36); P02752 (SEQ ID NO:37); Q05685 (SEQ ID NO:38); P35846 (SEQ ID NO:39); P02702 (SEQ ID NO:40); AAD53001 (SEQ ID NO:41); AAD33741 (SEQ ID NO:42); AAD33740 (SEQ ID NO:43); AAD19354 (SEQ ID NO:44); AAD19353 (SEQ ID NO:45); AAC98303 (SEQ ID NO:46); AAB81938 (SEQ ID NO:47); AAB81937 (SEQ ID NO:48); AAB49703 (SEQ ID NO:49); AAB35932 (SEQ ID NO:50); 1011184A (SEQ ID NO:51); 0908212A (SEQ ID NO:52); CAA44610 (SEQ ID NO:53); CAA83553 (SEQ ID NO:54); AAA74896 (SEQ ID NO:55); AAA49056 (SEQ ID NO:56); AAA37599 (SEQ ID NO:57); AAA37598 (SEQ ID NO:58); AAA37597 (SEQ ID NO:59); AAA37594 (SEQ ID NO:60); AAA37596 (SEQ ID NO:61); AAA37595 (SEQ ID NO:62); AAA35824 (SEQ ID NO:63); AAA35823 (SEQ ID NO:64); AAA35822 (SEQ ID NO:65); AAA35821 (SEQ ID NO:66); AAA18382 (SEQ ID NO:67); and AAA17370 (SEQ ID NO:68).

A skilled artisan also recognizes that epitopes of folate binding protein, nucleic acid sequences are encoded by, or altered to encode a variant of, for example, one of the following: U02715 (SEQ ID NO:69); BE518506 (SEQ ID NO:70); BG058247 (SEQ ID NO:71); BG017460 (SEQ ID NO:72); NM_(—)000802 (SEQ ID NO:73); U20391 (SEQ ID NO:74); NM_(—)016731 (SEQ ID NO:75); NM_(—)016730 (SEQ ID NO:76); NM_(—)016729 (SEQ ID NO:77); NM_(—)016725 (SEQ ID NO:78); NM_(—)016724 (SEQ ID NO:79); NM_(—)013307 (SEQ ID NO:80); NM_(—)008035 (SEQ ID NO:81); NM_(—)008034 (SEQ ID NO:82); BF153292 (SEQ ID NO:83); BF114518 (SEQ ID NO:84); BE940806 (SEQ ID NO:85); BE858996 (SEQ ID NO:86); AF219906 (SEQ ID NO:87); AF219905 (SEQ ID NO:88); AF219904 (SEQ ID NO:89); BE687177 (SEQ ID NO:90); BE636622 (SEQ ID NO:91); BE627230 (SEQ ID NO:92); BE506561 (SEQ ID NO:93); BE505048 (SEQ ID NO:94); BE496754 (SEQ ID NO:95); BB114010 (SEQ ID NO:96); BB109527 (SEQ ID NO:97); BB107219 (SEQ ID NO:98); BE206324 (SEQ ID NO:99); BE448392 (SEQ ID NO:100); BE207596 (SEQ ID NO:101); BE206635 (SEQ ID NO:102); BE240998 (SEQ ID NO:103); BE228221 (SEQ ID NO:104); BE225416 (SEQ ID NO:105); BE225404 (SEQ ID NO:106); BB214040 (SEQ ID NO:107); BE199619 (SEQ ID NO:108); BE199597 (SEQ ID NO:109); BE198610 (SEQ ID NO:110); BE198571 (SEQ ID NO:111); BE188055 (SEQ ID NO:112); BE187804 (SEQ ID NO:113); BB032646 (SEQ ID NO:114); BE037278 (SEQ ID NO:115); BE037125 (SEQ ID NO:116); BE037110 (SEQ ID NO:117); BE037009 (SEQ ID NO:118); BE036024 (SEQ ID NO:119); BE035828 (SEQ ID NO:120); BE035751 (SEQ ID NO:121); BE019724 (SEQ ID NO:122); AW913291 (SEQ ID NO:123); AW912445 (SEQ ID NO:124); AW823912 (SEQ ID NO:125); AW823418 (SEQ ID NO:126); AB023803 (SEQ ID NO:127); AB022344 (SEQ ID NO:128); AW475385 (SEQ ID NO:129); AW323586 (SEQ ID NO:130); AW319308 (SEQ ID NO:131); AW239668 (SEQ ID NO:132); AV253136 (SEQ ID NO:133); AW013716 (SEQ ID NO:134); AW013704 (SEQ ID NO:135); AW013702 (SEQ ID NO:136); AW013696 (SEQ ID NO:137); AW013669 (SEQ ID NO:138); AW013647 (SEQ ID NO:139); AW013501 (SEQ ID NO:140); AW013484 (SEQ ID NO:141); AW013428 (SEQ ID NO:142); AW013404 (SEQ ID NO:143); AW013386 (SEQ ID NO:144); AW013284 (SEQ ID NO:145); AW013183 (SEQ ID NO:146); AF061256 (SEQ ID NO:147); AI956572 (SEQ ID NO:148); AI882550 (SEQ ID NO:149); AI822932 (SEQ ID NO:150); AI785988 (SEQ ID NO:151); AI744273 (SEQ ID NO:152); AI727302 (SEQ ID NO:153); AI725714 (SEQ ID NO:154); AF137375 (SEQ ID NO:155); AF137374 (SEQ ID NO:156); AF137373 (SEQ ID NO:157); AF096320 (SEQ ID NO:158); AF096319 (SEQ ID NO:159); AI663857 (SEQ ID NO:160); AI647841 (SEQ ID NO:161); AI646950 (SEQ ID NO:162); AI607910 (SEQ ID NO:163); AI529173 (SEQ ID NO:164); AI509734 (SEQ ID NO:165); AI506267 (SEQ ID NO:166); AI498269 (SEQ ID NO:167); AI000444 (SEQ ID NO:168); AA956337 (SEQ ID NO:169); AA955042 (SEQ ID NO:170); AA899838 (SEQ ID NO:171); AA899718 (SEQ ID NO:172); AA858756 (SEQ ID NO:173); AI311561 (SEQ ID NO:174); AI385951 (SEQ ID NO:175); AI352406 (SEQ ID NO:176); AF100161 (SEQ ID NO:177); AI326503 (SEQ ID NO:178); AI325517 (SEQ ID NO:179); AI325453 (SEQ ID NO:180); AI325382 (SEQ ID NO:181); AI323700 (SEQ ID NO:182); AI323374 (SEQ ID NO:183); AI313973 (SEQ ID NO:184); AI196928 (SEQ ID NO:185); AF091041 (SEQ ID NO:186); AI156212 (SEQ ID NO:187); AI120374 (SEQ ID NO:188); AI119000 (SEQ ID NO:189); AA408670 (SEQ ID NO:190); AA408072 (SEQ ID NO:191); AA407615 (SEQ ID NO:192); AA995272 (SEQ ID NO:193); C78593 (SEQ ID NO:194); AA999910 (SEQ ID NO:195); AA991491 (SEQ ID NO:196); X99994 (SEQ ID NO:197); X99993 (SEQ ID NO:198); X99992 (SEQ ID NO:199); X99991 (SEQ ID NO:200); X99990 (SEQ ID NO:201); AA958985 (SEQ ID NO:202); AA873222 (SEQ ID NO:203); AA930051 (SEQ ID NO:204); AA895334 (SEQ ID NO:205); AA796142 (SEQ ID NO:206); AA798223 (SEQ ID NO:207); AA734325 (SEQ ID NO:208); AA690871 (SEQ ID NO:209); AA674988 (SEQ ID NO:210); AA674863 (SEQ ID NO:211); AA674821 (SEQ ID NO:212); AA674744 (SEQ ID NO:213); AA671558 (SEQ ID NO:214); AF000381 (SEQ ID NO:215); AF000380 (SEQ ID NO:216); AA637071 (SEQ ID NO:217); AA616314 (SEQ ID NO:218); AA109687 (SEQ ID NO:219); AA608235 (SEQ ID NO:220); AA589050 (SEQ ID NO:221); AA544782 (SEQ ID NO:222); AA522095 (SEQ ID NO:223); AA386821 (SEQ ID NO:224); AA386818 (SEQ ID NO:225); AA386495 (SEQ ID NO:226); AA289278 (SEQ ID NO:227); AA286342 (SEQ ID NO:228); AA276302 (SEQ ID NO:229); AA276123 (SEQ ID NO:230); AA277280 (SEQ ID NO:231); AA273543 (SEQ ID NO:232); U89949 (SEQ ID NO:233); AA208306 (SEQ ID NO:234); AA208089 (SEQ ID NO:235); AA242285 (SEQ ID NO:236); AA139715 (SEQ ID NO:237); AA139709 (SEQ ID NO:238); AA139675 (SEQ ID NO:239); AA139593 (SEQ ID NO:240); AA124010 (SEQ ID NO:241); AA108790 (SEQ ID NO:242); AA108350 (SEQ ID NO:243); AA028831 (SEQ ID NO:244); AA061275 (SEQ ID NO:245); W82933 (SEQ ID NO: 246); AA015571 (SEQ ID NO:247); W71715 (SEQ ID NO:248); W59165 (SEQ ID NO:249); X62753 (SEQ ID NO:250); Z32564 (SEQ ID NO:251); T29279 (SEQ ID NO:252); M25317 (SEQ ID NO:253); M86438 (SEQ ID NO:254); J03922 (SEQ ID NO:255); M64817 (SEQ ID NO:256); L25338 (SEQ ID NO:257); M97701 (SEQ ID NO:258); M97700 (SEQ ID NO:259); M64782 (SEQ ID NO:260); M35069 (SEQ ID NO:261); J05013 (SEQ ID NO:262); M28099 (SEQ ID NO:263); J02876 (SEQ ID NO:264); U08471 (SEQ ID NO:265); U02714 (SEQ ID NO:266); and U02716 (SEQ ID NO:267).

A skilled artisan also recognizes that the scope of the invention is not limited to the specific nonapeptides described in SEQ ID NO:1 through SEQ ID NO:8. The antigens comprising a FBP epitope may be at least about 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, or up to about 30. It is contemplated that any amino acid may be used for additions or filling in for the remainder of sequences in addition to the specific variant sequence provided herein. However, it is preferred that they will be those that will maintain the underlying sequence of FBP.

III. Rational Vaccine Design

The goal of rational vaccine design is to produce structural analogs of biologically active compounds. By creating such analogs, it is possible to fashion vaccines which are more active or stable than the natural molecules, which have different susceptibility to alteration or which may affect the function of various other molecules. In one approach, a skilled artisan generates a three-dimensional structure for the folate binding protein variant of the invention or a fragment thereof. This could be accomplished by X-ray crystallography, computer modeling, or by a combination of both approaches. An alternative approach involves the random replacement of functional groups throughout the folate binding protein variant, and the resulting affect on function is determined.

It also is possible to isolate a folate binding protein variant specific antibody, selected by a functional assay, and then solve its crystal structure. In principle, this approach yields a pharmacore upon which subsequent vaccine design can be based. It is possible to bypass protein crystallography altogether by generating anti-idiotypic antibodies to a functional, pharmacologically active antibody. As a mirror image of a mirror image, the binding site of anti-idiotype would be expected to be an analog of the original antigen. The anti-idiotype could then be used to identify and isolate peptides from banks of chemically- or biologically-produced peptides. Selected peptides would then serve as the vaccine.

Thus, one may design vaccines which have enhanced and improved biological activity, for example, anti-tumor activity, relative to a starting folate binding protein variant of the invention. By virtue of standard chemical isolation procedures and other descriptions herein, sufficient amounts of the folate binding protein variants of the invention can be produced to perform crystallographic studies. In addition, knowledge of the chemical characteristics of these compounds permits computer-employed predictions of structure-function relationships.

IV. Immunological Reagents

It is well known in the art that the immunogenicity of a particular immunogen composition can be enhanced by the use of non-specific stimulators of the immune response, known as adjuvants. Suitable adjuvants include all acceptable immunostimulatory compounds, such as cytokines, chemokines, cofactors, toxins, plasmodia, synthetic compositions or LEEs or CEEs encoding such adjuvants.

Adjuvants that may be used include IL-1, IL-2, IL-4, IL-7, IL-12, γ-interferon, GMCSP, BCG, aluminum hydroxide, MDP compounds, such as thur-MDP and nor-MDP, CGP (MTP-PE), lipid A, and monophosphoryl lipid A (MPL). RIBI, which contains three components extracted from bacteria, MPL, trehalose dimycolate (TDM) and cell wall skeleton (CWS) in a 2% squalene/Tween 80 emulsion is also contemplated. MHC antigens may even be used. Exemplary, often preferred adjuvants include complete Freund's adjuvant (a non-specific stimulator of the immune response containing killed Mycobacterium tuberculosis), incomplete Freund's adjuvants and aluminum hydroxide adjuvant.

In addition to adjuvants, it may be desirable to coadminister biologic response modifiers (BRM), which have been shown to upregulate T cell immunity or down-regulate suppressor cell activity. Such BRMs include, but are not limited to, Cimetidine (CIM; 1200 mg/d) (Smith/Kline, PA); low-dose Cyclophosphamide (CYP; 300 mg/m2) (Johnson/Mead, NJ), cytokines such as g-interferon, IL-2, or IL-12 or genes encoding proteins involved in immune helper functions, such as B-7.

A variety of routes can be used to administer the vaccines including but not limited to subcutaneous, intramuscular, intradermal, intraepidermal, intravenous and intraperitoneal.

An individual, such as a patient, is injected with vaccine generally as described above. The antigen may be mixed with adjuvant, such as Freund's complete or incomplete adjuvant. Booster administrations with the same vaccine or DNA encoding the same may occur at approximately two-week intervals.

V. Immunotherapeutic Agents

An immunotherapeutic agent generally relies on the use of immune effector cells and molecules to target and destroy cancer cells. The immune effector may be, for example, a folate binding protein variant which is or is similar to a tumor cell antigen. The variant alone may serve as an effector of therapy or it may recruit other cells to actually effect cell killing. The variant also may be conjugated to a drug or toxin (e.g., a chemotherapeutic, a radionuclide, a ricin A chain, a cholera toxin, a pertussis toxin, etc.) and serve merely as a targeting agent. Such antibody conjugates are called immunotoxins, and are well known in the art (see U.S. Pat. No. 5,686,072, U.S. Pat. No. 5,578,706, U.S. Pat. No. 4,792,447, U.S. Pat. No. 5,045,451, U.S. Pat. No. 4,664,911, and U.S. Pat. No. 5,767,072, each incorporated herein by reference). Alternatively, the effector may be a lymphocyte carrying a surface molecule that interacts, either directly or indirectly, with a tumor cell target. Various effector cells include cytotoxic T cells and NK cells.

In one aspect of immunotherapy, the tumor cell must bear some marker that is amenable to targeting, i.e., is not present on the majority of other cells. Many tumor markers exist in addition to folate binding protein described herein, and any of these may be suitable for targeting in the context of the present invention. Common tumor markers include carcinoembryonic antigen, prostate specific antigen, urinary tumor associated antigen, fetal antigen, tyrosinase (p97), gp68, TAG-72, HMFG, Sialyl Lewis Antigen, MucA, MucB, PLAP, estrogen receptor, laminin receptor, erb B and p155.

The disclosures presented herein have significant relevance to immunotherapy of human diseases and disorders, including cancer. In using the immunotherapeutic compositions derived from the antigen of the present invention in treatment methods, other standard treatments also may be employed, such as radiotherapy or chemotherapy. However, it is preferred that the immunotherapy be used alone initially as its effectiveness can be readily assessed. Immunotherapies of cancer can broadly be classified as adoptive, passive and active, as described in the following sections, and may be used or produced with the folate binding protein variant antigen of the present invention.

A. Immune Stimulators

A specific aspect of immunotherapy is to use an immune stimulating molecule as an agent, or more preferably in conjunction with another agent, such as, for example, a cytokine such as IL-2, IL-4, IL-12, GM-CSF, tumor necrosis factor; interferons alpha, beta, and gamma; F42K and other cytokine analogs; a chemokine such as, for example, MIP-1, MIP-1beta, MCP-1, RANTES, IL-8; or a growth factor such as, for example, FLT3 ligand.

One particular cytokine contemplated for use in the present invention is tumor necrosis factor. Tumor necrosis factor (TNF; Cachectin) is a glycoprotein that kills some kinds of cancer cells, activates cytokine production, activates macrophages and endothelial cells, promotes the production of collagen and collagenases, is an inflammatory mediator and also a mediator of septic shock, and promotes catabolism, fever and sleep. Some infectious agents cause tumor regression through the stimulation of TNF production. TNF can be quite toxic when used alone in effective doses, so that the optimal regimens probably will use it in lower doses in combination with other drugs. Its immunosuppressive actions are potentiated by gamma-interferon, so that the combination potentially is dangerous. A hybrid of TNF and interferon-a also has been found to possess anti-cancer activity.

Another cytokine specifically contemplate is interferon alpha. Interferon alpha has been used in treatment of hairy cell leukemia, Kaposi's sarcoma, melanoma, carcinoid, renal cell cancer, ovary cancer, bladder cancer, non-Hodgkin's lymphomas, mycosis fungoides, multiple myeloma, and chronic granulocytic leukemia.

B. Passive Immunotherapy

A number of different approaches for passive immunotherapy of cancer exist. They may be broadly categorized into the following: injection of vaccine alone; injection of vaccine coupled to toxins or chemotherapeutic agents; injection of vaccine coupled to radioactive isotopes; injection of anti-idiotype vaccine; and finally, purging of tumor cells in bone marrow.

It may be favorable to administer more than one vaccine associated with two different antigens or even vaccine with multiple antigen specificity. Treatment protocols also may include administration of lymphokines or other immune enhancers (Bajorin et al. 1988).

C. Active Immunotherapy

In some embodiments of the invention, active immunotherapy may be employed. In active immunotherapy, a folate binding protein variant (e.g., a peptide or polypeptide), a nucleic acid encoding a folate binding protein variant, and/or additional vaccine components, such as for example, a cell expressing the folate binding protein variant (e.g. a dendritic cell fused with a tumor cell, or an autologous or allogeneic tumor cell composition expressing the antigen), an adjuvant, a recombinant protein, an immunomodulator, and the like is administered (Ravindranath and Morton, 1991; Morton and Ravindranath, 1996; Morton et al., 1992; Okamoto et al., 1997; Kugler et al., 2000; Trefzer et al., 2000; Mitchell et al., 1990; Mitchell et al., 1993).

An antigenic peptide, polypeptide or protein, or an autologous or allogenic tumor cell composition or “vaccine” is administered generally with a distinct bacterial adjuvant (Ravindranath and Morton, 1991; Morton and Ravindranath, 1996; Morton et al., 1992; Mitchell et al., 1990; Mitchell et al., 1993). In melanoma immunotherapy, those patients who elicit high IgM response often survive better than those who elicit no or low IgM antibodies (Morton et al., 1992). IgM antibodies are often transient antibodies and the exception to the rule appears to be anti-ganglioside or anti-carbohydrate antibodies.

D. Adoptive Immunotherapy

In adoptive immunotherapy, the patient's circulating lymphocytes, or tumor infiltrated lymphocytes, are isolated in vitro, activated by lymphokines such as IL-2 or transduced with genes for tumor necrosis, and readministered (Rosenberg et al., 1988; 1989). To achieve this, one would administer to an animal, or human patient, an immunologically effective amount of activated lymphocytes in combination with an adjuvant-incorporated antigenic peptide composition as described herein. The activated lymphocytes will most preferably be the patient's own cells that were earlier isolated from a blood or tumor sample and activated (or “expanded”) in vitro. In certain embodiments, the patient's lymphocytes are cultured or expanded in number or selected for activity, such as immunoreactivity to the antigen. This form of immunotherapy has produced several cases of regression of melanoma and renal carcinoma.

VI. Vaccines

The present invention contemplates vaccines for use in both active and passive immunization embodiments. Immunogenic compositions, proposed to be suitable for use as a vaccine, may be prepared most readily directly from immunogenic CTL-stimulating peptides prepared in a manner disclosed herein. Preferably the antigenic material is extensively dialyzed to remove undesired small molecular weight molecules and/or lyophilized for more ready formulation into a desired vehicle.

The preparation of vaccines which contain peptide sequences as active ingredients is generally well understood in the art, as exemplified by U.S. Pat. Nos. 4,608,251; 4,601,903; 4,599,231; 4,599,230; 4,596,792; and 4,578,770, all incorporated herein by reference. Typically, such vaccines are prepared as injectables. Either as liquid solutions or suspensions: solid forms suitable for solution in, or suspension in, liquid prior to injection may also be prepared. The preparation may also be emulsified. The active immunogenic ingredient is often mixed with excipients which are pharmaceutically acceptable and compatible with the active ingredient. Suitable excipients are, for example, water, saline, dextrose, glycerol, ethanol, or the like and combinations thereof. In addition, if desired, the vaccine may contain minor amounts of auxiliary substances such as wetting or emulsifying agents, pH buffering agents, or adjuvants which enhance the effectiveness of the vaccines.

Vaccines may be conventionally administered parenterally, by injection, for example, either subcutaneously or intramuscularly. Additional formulations which are suitable for other modes of administration include suppositories and, in some cases, oral formulations. For suppositories, traditional binders and carriers may include, for example, polyalkalene glycols or triglycerides: such suppositories may be formed from mixtures containing the active ingredient in the range of about 0.5% to about 10%, preferably about 1 to about 2%. Oral formulations include such normally employed excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate and the like. These compositions take the form of solutions, suspensions, tablets, pills, capsules, sustained release formulations or powders and contain about 10 to about 95% of active ingredient, preferably about 25 to about 70%.

The peptides of the present invention may be formulated into the vaccine as neutral or salt forms. Pharmaceutically-acceptable salts, include the acid addition salts (formed with the free amino groups of the peptide) and those which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl groups may also be derived from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, procaine, and the like.

The vaccines are administered in a manner compatible with the dosage formulation, and in such amount as will be therapeutically effective and immunogenic. The quantity to be administered depends on the subject to be treated, including, e.g., the capacity of the individual's immune system to synthesize antibodies, and the degree of protection desired. Precise amounts of active ingredient required to be administered depend on the judgment of the practitioner. However, suitable dosage ranges are of the order of several hundred micrograms active ingredient per vaccination. Suitable regimes for initial administration and booster shots are also variable, but are typified by an initial administration followed by subsequent inoculations or other administrations.

The manner of application may be varied widely. Any of the conventional methods for administration of a vaccine are applicable. These are believed to include oral application on a solid physiologically acceptable base or in a physiologically acceptable dispersion, parenterally, by injection or the like. The dosage of the vaccine will depend on the route of administration and will vary according to the size of the host.

Various methods of achieving adjuvant effect for the vaccine includes use of agents such as aluminum hydroxide or phosphate (alum), commonly used as about 0.05 to about 0.1% solution in phosphate buffered saline, admixture with synthetic polymers of sugars (Carbopol®) used as an about 0.25% solution, aggregation of the protein in the vaccine by heat treatment with temperatures ranging between about 70° to about 101° C. for a 30-second to 2-minute period, respectively. Aggregation by reactivating with pepsin treated (Fab) antibodies to albumin, mixture with bacterial cells such as C. parvum or endotoxins or lipopolysaccharide components of Gram-negative bacteria, emulsion in physiologically acceptable oil vehicles such as mannide mono-oleate (Aracel A) or emulsion with a 20% solution of a perfluorocarbon (Fluosol-DA®) used as a block substitute may also be employed.

In many instances, it will be desirable to have multiple administrations of the vaccine, usually not exceeding six vaccinations, more usually not exceeding four vaccinations and preferably one or more, usually at least about three vaccinations. The vaccinations will normally be at from two to twelve week intervals, more usually from three to five week intervals. Periodic boosters at intervals of 1-5 years, usually three years, will be desirable to maintain protective levels of the antibodies. The course of the immunization may be followed by assays for antibodies for the supernatant antigens. The assays may be performed by labeling with conventional labels, such as radionuclides, enzymes, fluorescents, and the like. These techniques are well known and may be found in a wide variety of patents, such as U.S. Pat. Nos. 3,791,932; 4,174,384 and 3,949,064, as illustrative of these types of assays.

For an antigenic composition to be useful as a vaccine, an antigenic composition must induce an immune response to the antigen in a cell, tissue or animal (e.g., a human). As used herein, an “antigenic composition” may comprise an antigen (e.g., a peptide or polypeptide), a nucleic acid encoding an antigen (e.g., an antigen expression vector), or a cell expressing or presenting an antigen. In particular embodiments, the antigenic composition comprises or encodes a folate binding protein variant, or an immunologically functional equivalent thereof. In other embodiments, the antigenic composition is in a mixture that comprises an additional immunostimulatory agent or nucleic acids encoding such an agent. Immunostimulatory agents include but are not limited to an additional antigen, an immunomodulator, an antigen presenting cell or an adjuvant. In other embodiments, one or more of the additional agent(s) is covalently bonded to the antigen or an immunostimulatory agent, in any combination. In certain embodiments, the antigenic composition is conjugated to or comprises an HLA anchor motif amino acids.

In certain embodiments, an antigenic composition or immunologically functional equivalent, may be used as an effective vaccine in inducing an anti-folate binding protein variant humoral and/or cell-mediated immune response in an animal. The present invention contemplates one or more antigenic compositions or vaccines for use in both active and passive immunization embodiments.

A vaccine of the present invention may vary in its composition of proteinaceous, nucleic acid and/or cellular components. In a non-limiting example, a nucleic acid encoding an antigen might also be formulated with a proteinaceous adjuvant. Of course, it will be understood that various compositions described herein may further comprise additional components. For example, one or more vaccine components may be comprised in a lipid or liposome. In another non-limiting example, a vaccine may comprise one or more adjuvants. A vaccine of the present invention, and its various components, may be prepared and/or administered by any method disclosed herein or as would be known to one of ordinary skill in the art, in light of the present disclosure.

A. Proteinaceous Antigens

It is understood that an antigenic composition of the present invention may be made by a method that is well known in the art, including but not limited to chemical synthesis by solid phase synthesis and purification away from the other products of the chemical reactions by HPLC, or production by the expression of a nucleic acid sequence (e.g., a DNA sequence) encoding a peptide or polypeptide comprising an antigen of the present invention in an in vitro translation system or in a living cell. Preferably the antigenic composition is isolated and extensively dialyzed to remove one or more undesired small molecular weight molecules and/or lyophilized for more ready formulation into a desired vehicle. It is further understood that additional amino acids, mutations, chemical modification and the like, if any, that are made in a vaccine component will preferably not substantially interfere with the antibody recognition of the epitopic sequence.

A peptide or polypeptide corresponding to one or more antigenic determinants of the folate binding protein variant of the present invention should generally be at least five or six amino acid residues in length, and may contain up to about 10, about 15, about 20, or more. A peptide sequence may be synthesized by methods known to those of ordinary skill in the art, for example, peptide synthesis using automated peptide synthesis machines, such as those available from Applied Biosystems (Foster City, Calif.).

Longer peptides or polypeptides also may be prepared, e.g., by recombinant means. In certain embodiments, a nucleic acid encoding an antigenic composition and/or a component described herein may be used, for example, to produce an antigenic composition in vitro or in vivo for the various compositions and methods of the present invention. For example, in certain embodiments, a nucleic acid encoding an antigen is comprised in, for example, a vector in a recombinant cell. The nucleic acid may be expressed to produce a peptide or polypeptide comprising an antigenic sequence. The peptide or polypeptide may be secreted from the cell, or comprised as part of or within the cell.

B. Genetic Vaccine Antigens

In certain embodiments, an immune response may be promoted by transfecting or inoculating an animal with a nucleic acid encoding an antigen. One or more cells comprised within a target animal then expresses the sequences encoded by the nucleic acid after administration of the nucleic acid to the animal. Thus, the vaccine may comprise “genetic vaccine” useful for immunization protocols. A vaccine may also be in the form, for example, of a nucleic acid (e.g., a cDNA or an RNA) encoding all or part of the peptide or polypeptide sequence of an antigen. Expression in vivo by the nucleic acid may be, for example, by a plasmid type vector, a viral vector, or a viral/plasmid construct vector.

In preferred aspects, the nucleic acid comprises a coding region that encodes all or part of the sequences disclosed as SEQ ID NO:1 through SEQ ID NO:9, or an immunologically functional equivalent thereof. Of course, the nucleic acid may comprise and/or encode additional sequences, including but not limited to those comprising one or more immunomodulators or adjuvants. The nucleotide and protein, polypeptide and peptide encoding sequences for various genes have been previously disclosed, and may be found at computerized databases known to those of ordinary skill in the art. One such database is the National Center for Biotechnology Information's Genbank® and GenPept databases. The coding regions for these known genes may be amplified, combined with the nucleic acid sequences encoding the folate binding protein variant disclosed herein (e.g, ligated) and/or expressed using the techniques disclosed herein or by any technique that would be know to those of ordinary skill in the art (e.g, Sambrook et al., 1987). Though a nucleic acid may be expressed in an in vitro expression system, in preferred embodiments the nucleic acid comprises a vector for in vivo replication and/or expression.

C. Cellular Vaccine Antigens

In another embodiment, a cell expressing the antigen may comprise the vaccine. The cell may be isolated from a culture, tissue, organ or organism and administered to an animal as a cellular vaccine. Thus, the present invention contemplates a “cellular vaccine.” The cell may be transfected with a nucleic acid encoding an antigen to enhance its expression of the antigen. Of course, the cell may also express one or more additional vaccine components, such as immunomodulators or adjuvants. A vaccine may comprise all or part of the cell.

D. Immunologically Functional Equivalents

Modification and changes may be made in the structure of the peptides of the present invention and DNA segments which encode them and still obtain a functional molecule that encodes a protein or peptide with desirable characteristics. The following is a discussion based upon changing the amino acids of a protein to create an equivalent, or even an improved, second-generation molecule. The amino acid changes may be achieved by changing the codons of the DNA sequence, according to the following codon table:

TABLE 1 Amino Acids Codons Alanine Ala A GCA GCC GCG GCU Cysteine Cys C UGC UGU Aspartic acid Asp D GAC GAU Glutamic acid Glu E GAA GAG Phenylalanine Phe F UUC UUU Glycine Gly G GGA GGC GGG GGU Histidine His H CAC CAU Isoleucine Ile I AUA AUC AUU Lysine Lys K AAA AAG Leucine Leu L UUA UUG CUA CUC CUG CUU Methionine Met M AUG Asparagine Asn N AAC AAU Proline Pro P CCA CCC CCG CCU Glutamine Gln Q CAA CAG Arginine Arg R AGA AGG CGA CGC CGG CGU Serine Ser S AGC AGU UCA UCC UCG UCU Threonine Thr T ACA ACC ACG ACU Valine Val V GUA GUC GUG GUU Tryptophan Trp W UGG Tyrosine Tyr Y UAC UAU

For example, certain amino acids may be substituted for other amino acids in a protein structure without appreciable loss of interactive binding capacity with structures such as, for example, antigen-binding regions of antibodies or binding sites on substrate molecules. Since it is the interactive capacity and nature of a protein that defines that protein's biological functional activity, certain amino acid sequence substitutions can be made in a protein sequence, and, of course, its underlying DNA coding sequence, and nevertheless obtain a protein with like properties. It is thus contemplated by the inventors that various changes may be made in the peptide sequences of the disclosed compositions, or corresponding DNA sequences which encode the peptides without appreciable loss of their biological utility or activity. Amino acid substitutions may be based on the relative similarity of the amino acid side-chain substituents, for example, their hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions which take various of the foregoing characteristics into consideration are well known to those of skill in the art.

Numerous scientific publications have also been devoted to the prediction of secondary structure, and to the identification of an epitope, from analyses of an amino acid sequence (Chou and Fasman, 1974a,b; 1978a,b, 1979). Any of these may be used, if desired, to supplement the teachings of U.S. Pat. No. 4,554,101.

Moreover, computer programs are currently available to assist with predicting an antigenic portion and an epitopic core region of one or more proteins, polypeptides or peptides. Examples include those programs based upon the Jameson-Wolf analysis (Jameson & Wolf, 1988; Wolf et al., 1988), the program PepPlot® (Brutlag et al., 1990; Weinberger et al., 1985), and other new programs for protein tertiary structure prediction (Fetrow & Bryant, 1993). Another commercially available software program capable of carrying out such analyses is MacVector® (IBI, New Haven, Conn.).

As modifications and changes may be made in the structure of an antigenic composition (e.g., a folate binding protein variant) of the present invention, and still obtain molecules having like or otherwise desirable characteristics, such immunologically functional equivalents are also encompassed within the present invention.

For example, certain amino acids may be substituted for other amino acids in a peptide, polypeptide or protein structure without appreciable loss of interactive binding capacity with structures such as, for example, antigen-binding regions of antibodies, binding sites on substrate molecules or receptors, DNA binding sites, or such like. Since it is the interactive capacity and nature of a peptide, polypeptide or protein that defines its biological (e.g., immunological) functional activity, certain amino acid sequence substitutions can be made in a amino acid sequence (or, of course, its underlying DNA coding sequence) and nevertheless obtain a peptide or polypeptide with like (agonistic) properties. It is thus contemplated by the inventors that various changes may be made in the sequence of an antigenic composition such as, for example a folate binding protein variant peptide or polypeptide, or underlying DNA, without appreciable loss of biological utility or activity.

Accordingly, antigenic composition, particularly an immunologically functional equivalent of the sequences disclosed herein, may encompass an amino molecule sequence comprising at least one of the 20 common amino acids in naturally synthesized proteins, or at least one modified or unnatural amino acid, including but not limited to those shown on Table 2 below.

TABLE 2 Modified, Unnatural or Rare Amino Acids Abbr. Amino Acid Abbr. Amino Acid Aad 2-Aminoadipic acid EtAsn N-Ethylasparagine Baad 3-Aminoadipic acid Hyl Hydroxylysine Bala β-alanine, b-Amino- Ahyl Allo-Hydroxylysine propionic acid Abu 2-Aminobutyric acid 3Hyp 3-Hydroxyproline 4Abu 4-Aminobutyric acid, 4Hyp 4-Hydroxyproline piperidinic acid Acp 6-Aminocaproic acid Ide Isodesmosine Ahe 2-Aminoheptanoic acid Aile Allo-Isoleucine Aib 2-Aminoisobutyric acid MeGly N-Methylglycine, sarcosine Baib 3-Aminoisobutyric acid MeIle N-Methylisoleucine Apm 2-Aminopimelic acid MeLys 6-N-Methyllysine Dbu 2,4-Diaminobutyric acid MeVal N-Methylvaline Des Desmosine Nva Norvaline Dpm 2,2′-Diaminopimelic acid Nle Norleucine Dpr 2,3-Diaminopropionic acid Orn Ornithine EtGly N-Ethylglycine

In terms of immunologically functional equivalent, it is well understood by the skilled artisan that, inherent in the definition is the concept that there is a limit to the number of changes that may be made within a defined portion of the molecule and still result in a molecule with an acceptable level of equivalent immunological activity. An immunologically functional equivalent peptide or polypeptide are thus defined herein as those peptide(s) or polypeptide(s) in which certain, not most or all, of the amino acid(s) may be substituted.

In particular, where a shorter length peptide is concerned, it is contemplated that fewer amino acid substitutions should be made within the given peptide. A longer polypeptide may have an intermediate number of changes. The full-length protein will have the most tolerance for a larger number of changes. Of course, a plurality of distinct polypeptides/peptides with different substitutions may easily be made and used in accordance with the invention.

It also is well understood that where certain residues are shown to be particularly important to the immunological or structural properties of a protein or peptide, e.g., residues in binding regions or active sites, such residues may not generally be exchanged. This is an important consideration in the present invention, where changes in the folate binding protein variant antigenic site should be carefully considered and subsequently tested to ensure maintenance of immunological function (e.g., antigenicity), where maintenance of immunological function is desired. In this manner, functional equivalents are defined herein as those peptides or polypeptides which maintain a substantial amount of their native immunological activity.

Amino acid substitutions are generally based on the relative similarity of the amino acid side-chain substituents, for example, their hydrophobicity, hydrophilicity, charge, size, and the like. An analysis of the size, shape and type of the amino acid side-chain substituents reveals that arginine, lysine and histidine are all positively charged residues; that alanine, glycine and serine are all a similar size; and that phenylalanine, tryptophan and tyrosine all have a generally similar shape. Careful selection of a particular amino acid substitution for a peptide, as opposed to a protein, must be considered given the differences in size between peptides and proteins.

In further embodiments, major antigenic determinants of a peptide or polypeptide may be identified by an empirical approach in which portions of a nucleic acid encoding a peptide or polypeptide are expressed in a recombinant host, and the resulting peptide(s) or polypeptide(s) tested for their ability to elicit an immune response. For example, PCR™ can be used to prepare a range of peptides or polypeptides lacking successively longer fragments of the C-terminus of the amino acid sequence. The immunoactivity of each of these peptides or polypeptides is determined to identify those fragments or domains that are immunodominant. Further studies in which only a small number of amino acids are removed at each iteration then allows the location of the antigenic determinant(s) of the peptide or polypeptide to be more precisely determined.

Another method for determining a major antigenic determinant of a peptide or polypeptide is the SPOTs™ system (Genosys Biotechnologies, Inc., The Woodlands, Tex.). In this method, overlapping peptides are synthesized on a cellulose membrane, which following synthesis and deprotection, is screened using a polyclonal or monoclonal antibody. An antigenic determinant of the peptides or polypeptides which are initially identified can be further localized by performing subsequent syntheses of smaller peptides with larger overlaps, and by eventually replacing individual amino acids at each position along the immunoreactive sequence.

Once one or more such analyses are completed, an antigenic composition, such as for example a peptide or a polypeptide is prepared that contain at least the essential features of one or more antigenic determinants. An antigenic composition is then employed in the generation of antisera against the composition, and preferably the antigenic determinant(s).

While discussion has focused on functionally equivalent polypeptides arising from amino acid changes, it will be appreciated that these changes may be effected by alteration of the encoding DNA; taking into consideration also that the genetic code is degenerate and that two or more codons may code for the same amino acid. Nucleic acids encoding these antigenic compositions also can be constructed and inserted into one or more expression vectors by standard methods (Sambrook et al., 1987), for example, using PCR™ cloning methodology.

In addition to the peptidyl compounds described herein, the inventors also contemplate that other sterically similar compounds may be formulated to mimic the key portions of the peptide or polypeptide structure or to interact specifically with, for example, an antibody. Such compounds, which may be termed peptidomimetics, may be used in the same manner as a peptide or polypeptide of the invention and hence are also immunologically functional equivalents.

Certain mimetics that mimic elements of protein secondary structure are described in Johnson et al. (1993). The underlying rationale behind the use of peptide mimetics is that the peptide backbone of proteins exists chiefly to orientate amino acid side chains in such a way as to facilitate molecular interactions, such as those of antibody and antigen. A peptide mimetic is thus designed to permit molecular interactions similar to the natural molecule.

E. Antigen Mutagenesis

In particular embodiments, an antigenic composition is mutated for purposes such as, for example, enhancing its immunogenicity or producing or identifying an immunologically functional equivalent sequence. Methods of mutagenesis are well known to those of skill in the art (Sambrook et al., 1987).

As used herein, the term “oligonucleotide directed mutagenesis procedure” refers to template-dependent processes and vector-mediated propagation which result in an increase in the concentration of a specific nucleic acid molecule relative to its initial concentration, or in an increase in the concentration of a detectable signal, such as amplification. As used herein, the term “oligonucleotide directed mutagenesis procedure” is intended to refer to a process that involves the template-dependent extension of a primer molecule. The term template dependent process refers to nucleic acid synthesis of an RNA or a DNA molecule wherein the sequence of the newly synthesized strand of nucleic acid is dictated by the well-known rules of complementary base pairing (see, for example, Watson, 1987). Typically, vector mediated methodologies involve the introduction of the nucleic acid fragment into a DNA or RNA vector, the clonal amplification of the vector, and the recovery of the amplified nucleic acid fragment. Examples of such methodologies are provided by U.S. Pat. No. 4,237,224, specifically incorporated herein by reference in its entirety.

In a preferred embodiment, site directed mutagenesis is used. Site-specific mutagenesis is a technique useful in the preparation of an antigenic composition (e.g., a folate binding protein variant-comprising peptide or polypeptide, or immunologically functional equivalent protein, polypeptide or peptide), through specific mutagenesis of the underlying DNA. In general, the technique of site-specific mutagenesis is well known in the art. The technique further provides a ready ability to prepare and test sequence variants, incorporating one or more of the foregoing considerations, by introducing one or more nucleotide sequence changes into the DNA. Site-specific mutagenesis allows the production of a mutant through the use of specific oligonucleotide sequence(s) which encode the DNA sequence of the desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a primer sequence of sufficient size and sequence complexity to form a stable duplex on both sides of the position being mutated. Typically, a primer of about 17 to about 75 nucleotides in length is preferred, with about 10 to about 25 or more residues on both sides of the position being altered, while primers of about 17 to about 25 nucleotides in length being more preferred, with about 5 to 10 residues on both sides of the position being altered.

In general, site-directed mutagenesis is performed by first obtaining a single-stranded vector, or melting of two strands of a double stranded vector which includes within its sequence a DNA sequence encoding the desired protein. As will be appreciated by one of ordinary skill in the art, the technique typically employs a bacteriophage vector that exists in both a single stranded and double stranded form. Typical vectors useful in site-directed mutagenesis include vectors such as the M13 phage. These phage vectors are commercially available and their use is generally well known to those skilled in the art. Double stranded plasmids are also routinely employed in site directed mutagenesis, which eliminates the step of transferring the gene of interest from a phage to a plasmid.

This mutagenic primer is then annealed with the single-stranded DNA preparation, and subjected to DNA polymerizing enzymes such as, for example, E. coli polymerase I Klenow fragment, in order to complete the synthesis of the mutation-bearing strand. Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated sequence and the second strand bears the desired mutation. This heteroduplex vector is then used to transform appropriate cells, such as E. coli cells, and clones are selected that include recombinant vectors bearing the mutated sequence arrangement.

Alternatively, a pair of primers may be annealed to two separate strands of a double stranded vector to simultaneously synthesize both corresponding complementary strands with the desired mutation(s) in a PCR™ reaction. A genetic selection scheme to enrich for clones incorporating the mutagenic oligonucleotide has been devised (Kunkel et al., 1987). Alternatively, the use of PCR™ with commercially available thermostable enzymes such as Taq polymerase may be used to incorporate a mutagenic oligonucleotide primer into an amplified DNA fragment that can then be cloned into an appropriate cloning or expression vector (Tomic et al., 1990; Upender et al., 1995). A PCR™ employing a thermostable ligase in addition to a thermostable polymerase also may be used to incorporate a phosphorylated mutagenic oligonucleotide into an amplified DNA fragment that may then be cloned into an appropriate cloning or expression vector (Michael 1994).

The preparation of sequence variants of the selected gene using site-directed mutagenesis is provided as a means of producing potentially useful species and is not meant to be limiting, as there are other ways in which sequence variants of genes may be obtained. For example, recombinant vectors encoding the desired gene may be treated with mutagenic agents, such as hydroxylamine, to obtain sequence variants.

Additionally, one particularly useful mutagenesis technique is alanine scanning mutagenesis in which a number of residues are substituted individually with the amino acid alanine so that the effects of losing side-chain interactions can be determined, while minimizing the risk of large-scale perturbations in protein conformation (Cunningham et al., 1989).

F. Vectors

In order to effect replication, expression or mutagenesis of a nucleic acid, the nucleic acid may be delivered (“transfected”) into a cell. The tranfection of cells may be used, in certain embodiments, to recombinately produce one or more vaccine components for subsequent purification and preparation into a pharmaceutical vaccine. In other embodiments, the nucleic acid may be comprised as a genetic vaccine that is administered to an animal. In other embodiments, the nucleic acid is transfected into a cell and the cell administered to an animal as a cellular vaccine component. The nucleic acid may consist only of naked recombinant DNA, or may comprise, for example, additional materials to protect the nucleic acid and/or aid its targeting to specific cell types.

The term “vector” is used to refer to a carrier nucleic acid molecule into which a nucleic acid sequence can be inserted for introduction into a cell where it can be replicated. A nucleic acid sequence can be “exogenous,” which means that it is foreign to the cell into which the vector is being introduced or that the sequence is homologous to a sequence in the cell but in a position within the host cell nucleic acid in which the sequence is ordinarily not found. Vectors include plasmids, cosmids, viruses (bacteriophage, animal viruses, and plant viruses), and artificial chromosomes (e.g., YACs). One of skill in the art would be well equipped to construct a vector through standard recombinant techniques (see, for example, Maniatis et al., 1988 and Ausubel et al., 1994, both incorporated herein by reference).

The term “expression vector” refers to any type of genetic construct comprising a nucleic acid coding for a RNA capable of being transcribed. In some cases, RNA molecules are then translated into a protein, polypeptide, or peptide. In other cases, these sequences are not translated, for example, in the production of antisense molecules or ribozymes. Expression vectors can contain a variety of “control sequences,” which refer to nucleic acid sequences necessary for the transcription and possibly translation of an operably linked coding sequence in a particular host cell.

The nucleic acid encoding the antigenic composition or other vaccine component may be stably integrated into the genome of the cell, or may be stably maintained in the cell as a separate, episomal segment of DNA. Such nucleic acid segments or “episomes” encode sequences sufficient to permit maintenance and replication independent of or in synchronization with the host cell cycle. Vectors and expression vectors may contain nucleic acid sequences that serve other functions as well and are described infra. How the expression construct is delivered to a cell and where in the cell the nucleic acid remains is dependent on the type of expression construct employed.

1. Promoters and Enhancers

A “promoter” is a control sequence that is a region of a nucleic acid sequence at which initiation and rate of transcription are controlled. It may contain genetic elements at which regulatory proteins and molecules may bind, such as RNA polymerase and other transcription factors, to initiate the specific transcription a nucleic acid sequence. The phrases “operatively positioned,” “operatively linked,” “under control,” and “under transcriptional control” mean that a promoter is in a correct functional location and/or orientation in relation to a nucleic acid sequence to control transcriptional initiation and/or expression of that sequence.

A promoter generally comprises a sequence that functions to position the start site for RNA synthesis. The best known example of this is the TATA box, but in some promoters lacking a TATA box, such as, for example, the promoter for the mammalian terminal deoxynucleotidyl transferase gene and the promoter for the SV40 late genes, a discrete element overlying the start site itself helps to fix the place of initiation. Additional promoter elements regulate the frequency of transcriptional initiation. Typically, these are located in the region 30-110 bp upstream of the start site, although a number of promoters have been shown to contain functional elements downstream of the start site as well. To bring a coding sequence “under the control of” a promoter, one positions the 5′ end of the transcription initiation site of the transcriptional reading frame “downstream” of (i.e., 3′ of) the chosen promoter. The “upstream” promoter stimulates transcription of the DNA and promotes expression of the encoded RNA.

The spacing between promoter elements frequently is flexible, so that promoter function is preserved when elements are inverted or moved relative to one another. In the tk promoter, the spacing between promoter elements can be increased to 50 bp apart before activity begins to decline. Depending on the promoter, it appears that individual elements can function either cooperatively or independently to activate transcription. A promoter may or may not be used in conjunction with an “enhancer,” which refers to a cis-acting regulatory sequence involved in the transcriptional activation of a nucleic acid sequence.

A promoter may be one naturally associated with a nucleic acid sequence, as may be obtained by isolating the 5′ non-coding sequences located upstream of the coding segment and/or exon. Such a promoter can be referred to as “endogenous.” Similarly, an enhancer may be one naturally associated with a nucleic acid sequence, located either downstream or upstream of that sequence. Alternatively, certain advantages will be gained by positioning the coding nucleic acid segment under the control of a recombinant or heterologous promoter, which refers to a promoter that is not normally associated with a nucleic acid sequence in its natural environment. A recombinant or heterologous enhancer refers also to an enhancer not normally associated with a nucleic acid sequence in its natural environment. Such promoters or enhancers may include promoters or enhancers of other genes, and promoters or enhancers isolated from any other virus, or prokaryotic or eukaryotic cell, and promoters or enhancers not “naturally occurring,” i.e., containing different elements of different transcriptional regulatory regions, and/or mutations that alter expression. For example, promoters that are most commonly used in recombinant DNA construction include the β-lactamase (penicillinase), lactose and tryptophan (trp) promoter systems. In addition to producing nucleic acid sequences of promoters and enhancers synthetically, sequences may be produced using recombinant cloning and/or nucleic acid amplification technology, including PCR™, in connection with the compositions disclosed herein (see U.S. Pat. Nos. 4,683,202 and 5,928,906, each incorporated herein by reference). Furthermore, it is contemplated the control sequences that direct transcription and/or expression of sequences within non-nuclear organelles such as mitochondria, chloroplasts, and the like, can be employed as well.

Naturally, it will be important to employ a promoter and/or enhancer that effectively directs the expression of the DNA segment in the organelle, cell type, tissue, organ, or organism chosen for expression. Those of skill in the art of molecular biology generally know the use of promoters, enhancers, and cell type combinations for protein expression, (see, for example Sambrook et al. 1989, incorporated herein by reference). The promoters employed may be constitutive, tissue-specific, inducible, and/or useful under the appropriate conditions to direct high level expression of the introduced DNA segment, such as is advantageous in the large-scale production of recombinant proteins and/or peptides. The promoter may be heterologous or endogenous.

Additionally any promoter/enhancer combination (as per, for example, the Eukaryotic Promoter Data Base EPDB could also be used to drive expression. Use of a T3, T7 or SP6 cytoplasmic expression system is another possible embodiment. Eukaryotic cells can support cytoplasmic transcription from certain bacterial promoters if the appropriate bacterial polymerase is provided, either as part of the delivery complex or as an additional genetic expression construct.

Table 3 lists non-limiting examples of elements/promoters that may be employed, in the context of the present invention, to regulate the expression of a RNA. Table 4 provides non-limiting examples of inducible elements, which are regions of a nucleic acid sequence that can be activated in response to a specific stimulus.

TABLE 3 Promoter and/or Enhancer Promoter/Enhancer References Immunoglobulin Heavy Chain Banerji et al., 1983; Gilles et al., 1983; Grosschedl et al., 1985; Atchinson et al., 1986, 1987; Imler et al., 1987; Weinberger et al., 1984; Kiledjian et al., 1988; Porton et al.; 1990 Immunoglobulin Light Chain Queen et al., 1983; Picard et al., 1984 T-Cell Receptor Luria et al., 1987; Winoto et al., 1989; Redondo et al.; 1990 HLA DQ a and/or DQ β Sullivan et al., 1987 β-Interferon Goodbourn et al., 1986; Fujita et al., 1987; Goodbourn et al., 1988 Interleukin-2 Greene et al., 1989 Interleukin-2 Receptor Greene et al., 1989; Lin et al., 1990 MHC Class II 5 Koch et al., 1989 MHC Class II HLA-DRa Sherman et al., 1989 β-Actin Kawamoto et al., 1988; Ng et al.; 1989 Muscle Creatine Kinase (MCK) Jaynes et al., 1988; Horlick et al., 1989; Johnson et al., 1989 Prealbumin (Transthyretin) Costa et al., 1988 Elastase I Omitz et al., 1987 Metallothionein (MTII) Karin et al., 1987; Culotta et al., 1989 Collagenase Pinkert et al., 1987; Angel et al., 1987 Albumin Pinkert et al., 1987; Tronche et al., 1989, 1990 α-Fetoprotein Godbout et al., 1988; Campere et al., 1989 t-Globin Bodine et al., 1987; Perez-Stable et al., 1990 β-Globin Trudel et al., 1987 c-fos Cohen et al., 1987 c-HA-ras Triesman, 1986; Deschamps et al., 1985 Insulin Edlund et al., 1985 Neural Cell Adhesion Molecule Hirsh et al., 1990 (NCAM) α₁-Antitrypain Latimer et al., 1990 H2B (TH2B) Histone Hwang et al., 1990 Mouse and/or Type I Collagen Ripe et al., 1989 Glucose-Regulated Proteins Chang et al., 1989 (GRP94 and GRP78) Rat Growth Hormone Larsen et al., 1986 Human Serum Amyloid A (SAA) Edbrooke et al., 1989 Troponin I (TN I) Yutzey et al., 1989 Platelet-Derived Growth Factor Pech et al., 1989 (PDGF) Duchenne Muscular Dystrophy Klamut et al., 1990 SV40 Banerji et al., 1981; Moreau et al., 1981; Sleigh et al., 1985; Firak et al., 1986; Herr et al., 1986; Imbra et al., 1986; Kadesch et al., 1986; Wang et al., 1986; Ondek et al., 1987; Kuhl et al., 1987; Schaffner et al., 1988 Polyoma Swartzendruber et al., 1975; Vasseur et al., 1980; Katinka et al., 1980, 1981; Tyndell et al., 1981; Dandolo et al., 1983; de Villiers et al., 1984; Hen et al., 1986; Satake et al., 1988; Campbell and/or Villarreal, 1988 Retroviruses Kriegler et al., 1982, 1983; Levinson et al., 1982; Kriegler et al., 1983, 1984a, b, 1988; Bosze et al., 1986; Miksicek et al., 1986; Celander et al., 1987; Thiesen et al., 1988; Celander et al., 1988; Chol et al., 1988; Reisman et al., 1989 Papilloma Virus Campo et al., 1983; Lusky et al., 1983; Spandidos and/or Wilkie, 1983; Spalholz et al., 1985; Lusky et al., 1986; Cripe et al., 1987; Gloss et al., 1987; Hirochika et al., 1987; Stephens et al., 1987; Glue et al., 1988 Hepatitis B Virus Bulla et al., 1986; Jameel et al., 1986; Shaul et al., 1987; Spandau et al., 1988; Vannice et al., 1988 Human Immunodeficiency Virus Muesing et al., 1987; Hauber et al., 1988; Jakobovits et al., 1988; Feng et al., 1988; Takebe et al., 1988; Rosen et al., 1988; Berkhout et al., 1989; Laspia et al., 1989; Sharp et al., 1989; Braddock et al., 1989 Cytomegalovirus (CMV) Weber et al., 1984; Boshart et al., 1985; Foecking et al., 1986 Gibbon Ape Leukemia Virus Holbrook et al., 1987; Quinn et al., 1989

TABLE 4 Inducible Elements Element Inducer References MT II Phorbol Ester (TFA) Palmiter et al., 1982; Haslinger et Heavy metals al., 1985; Searle et al., 1985; Stuart et al., 1985; Imagawa et al., 1987, Karin et al., 1987; Angel et al., 1987b; McNeall et al., 1989 MMTV (mouse Glucocorticoids Huang et al., 1981; Lee et al., mammary tumor 1981; Majors et al., 1983; virus) Chandler et al., 1983; Lee et al., 1984; Ponta et al., 1985; Sakai et al., 1988 β-Interferon poly(rI)x Tavernier et al., 1983 poly(rc) Adenovirus 5 E2 ElA Imperiale et al., 1984 Collagenase Phorbol Ester (TPA) Angel et al., 1987a Stromelysin Phorbol Ester (TPA) Angel et al., 1987b SV40 Phorbol Ester (TPA) Angel et al., 1987b Murine MX Gene Interferon, Hug et al., 1988 Newcastle Disease Virus GRP78 Gene A23187 Resendez et al., 1988 α-2-Macroglobulin IL-6 Kunz et al., 1989 Vimentin Serum Rittling et al., 1989 MHC Class I Gene Interferon Blanar et al., 1989 H-2κb HSP70 ElA, SV40 Large T Taylor et al., 1989, 1990a, 1990b Antigen Proliferin Phorbol Ester-TPA Mordacq et al., 1989 Tumor Necrosis PMA Hensel et al., 1989 Factor Thyroid Thyroid Hormone Chatterjee et al., 1989 Stimulating Hormone α Gene

The identity of tissue-specific promoters or elements, as well as assays to characterize their activity, is well known to those of skill in the art. Nonlimiting examples of such regions include the human LIMK2 gene (Nomoto et al. 1999), the somatostatin receptor 2 gene (Kraus et al., 1998), murine epididymal retinoic acid-binding gene (Lareyre et al., 1999), human CD4 (Zhao-Emonet et al., 1998), mouse alpha2 (XI) collagen (Tsumaki, et al., 1998), DIA dopamine receptor gene (Lee, et al., 1997), insulin-like growth factor II (Wu et al., 1997), and human platelet endothelial cell adhesion molecule-1 (Almendro et al., 1996).

2. Initiation Signals and Internal Ribosome Binding Sites

A specific initiation signal also may be required for efficient translation of coding sequences. These signals include the ATG initiation codon or adjacent sequences. Exogenous translational control signals, including the ATG initiation codon, may need to be provided. One of ordinary skill in the art would readily be capable of determining this and providing the necessary signals. It is well known that the initiation codon must be “in-frame” with the reading frame of the desired coding sequence to ensure translation of the entire insert. The exogenous translational control signals and initiation codons can be either natural or synthetic. The efficiency of expression may be enhanced by the inclusion of appropriate transcription enhancer elements.

In certain embodiments of the invention, the use of internal ribosome entry sites (IRES) elements are used to create multigene, or polycistronic, messages. IRES elements are able to bypass the ribosome scanning model of 5′ methylated Cap dependent translation and begin translation at internal sites (Pelletier and Sonenberg, 1988). IRES elements from two members of the picornavirus family (polio and encephalomyocarditis) have been described (Pelletier and Sonenberg, 1988), as well an IRES from a mammalian message (Macejak and Sarnow, 1991). IRES elements can be linked to heterologous open reading frames. Multiple open reading frames can be transcribed together, each separated by an IRES, creating polycistronic messages. By virtue of the IRES element, each open reading frame is accessible to ribosomes for efficient translation. Multiple genes can be efficiently expressed using a single promoter/enhancer to transcribe a single message (see U.S. Pat. Nos. 5,925,565 and 5,935,819, each herein incorporated by reference).

3. Multiple Cloning Sites

Vectors can include a multiple cloning site (MCS), which is a nucleic acid region that contains multiple restriction enzyme sites, any of which can be used in conjunction with standard recombinant technology to digest the vector (see, for example, Carbonelli et al., 1999, Levenson et al., 1998, and Cocea, 1997, incorporated herein by reference.) “Restriction enzyme digestion” refers to catalytic cleavage of a nucleic acid molecule with an enzyme that functions only at specific locations in a nucleic acid molecule. Many of these restriction enzymes are commercially available. Use of such enzymes is widely understood by those of skill in the art. Frequently, a vector is linearized or fragmented using a restriction enzyme that cuts within the MCS to enable exogenous sequences to be ligated to the vector. “Ligation” refers to the process of forming phosphodiester bonds between two nucleic acid fragments, which may or may not be contiguous with each other. Techniques involving restriction enzymes and ligation reactions are well known to those of skill in the art of recombinant technology.

4. Splicing Sites

Most transcribed eukaryotic RNA molecules will undergo RNA splicing to remove introns from the primary transcripts. Vectors containing genomic eukaryotic sequences may require donor and/or acceptor splicing sites to ensure proper processing of the transcript for protein expression (see, for example, Chandler et al., 1997, herein incorporated by reference.)

5. Termination Signals

The vectors or constructs of the present invention will generally comprise at least one termination signal. A “termination signal” or “terminator” is comprised of the DNA sequences involved in specific termination of an RNA transcript by an RNA polymerase. Thus, in certain embodiments a termination signal that ends the production of an RNA transcript is contemplated. A terminator may be necessary in vivo to achieve desirable message levels.

In eukaryotic systems, the terminator region may also comprise specific DNA sequences that permit site-specific cleavage of the new transcript so as to expose a polyadenylation site. This signals a specialized endogenous polymerase to add a stretch of about 200 A residues (polyA) to the 3′ end of the transcript. RNA molecules modified with this polyA tail appear to more stable and are translated more efficiently. Thus, in other embodiments involving eukaryotes, it is preferred that that terminator comprises a signal for the cleavage of the RNA, and it is more preferred that the terminator signal promotes polyadenylation of the message. The terminator and/or polyadenylation site elements can serve to enhance message levels and to minimize read through from the cassette into other sequences.

Terminators contemplated for use in the invention include any known terminator of transcription described herein or known to one of ordinary skill in the art, including but not limited to, for example, the termination sequences of genes, such as for example the bovine growth hormone terminator or viral termination sequences, such as for example the SV40 terminator. In certain embodiments, the termination signal may be a lack of transcribable or translatable sequence, such as due to a sequence truncation.

6. Polyadenylation Signals

In expression, particularly eukaryotic expression, one will typically include a polyadenylation signal to effect proper polyadenylation of the transcript. The nature of the polyadenylation signal is not believed to be crucial to the successful practice of the invention, and any such sequence may be employed. Preferred embodiments include the SV40 polyadenylation signal or the bovine growth hormone polyadenylation signal, convenient and known to function well in various target cells. Polyadenylation may increase the stability of the transcript or may facilitate cytoplasmic transport.

7. Origins of Replication

In order to propagate a vector in a host cell, it may contain one or more origins of replication sites (often termed “ori”), which is a specific nucleic acid sequence at which replication is initiated. Alternatively an autonomously replicating sequence (ARS) can be employed if the host cell is yeast.

8. Selectable and Screenable Markers

In certain embodiments of the invention, cells containing a nucleic acid construct of the present invention may be identified in vitro or in vivo by including a marker in the expression vector. Such markers would confer an identifiable change to the cell permitting easy identification of cells containing the expression vector. Generally, a selectable marker is one that confers a property that allows for selection. A positive selectable marker is one in which the presence of the marker allows for its selection, while a negative selectable marker is one in which its presence prevents its selection. An example of a positive selectable marker is a drug resistance marker.

Usually the inclusion of a drug selection marker aids in the cloning and identification of transformants, for example, genes that confer resistance to neomycin, puromycin, hygromycin, DHFR, GPT, zeocin and histidinol are useful selectable markers. In addition to markers conferring a phenotype that allows for the discrimination of transformants based on the implementation of conditions, other types of markers including screenable markers such as GFP, whose basis is calorimetric analysis, are also contemplated. Alternatively, screenable enzymes such as herpes simplex virus thymidine kinase (tk) or chloramphenicol acetyltransferase (CAT) may be utilized. One of skill in the art would also know how to employ immunologic markers, possibly in conjunction with FACS analysis. The marker used is not believed to be important, so long as it is capable of being expressed simultaneously with the nucleic acid encoding a gene product. Further examples of selectable and screenable markers are well known to one of skill in the art.

9. Plasmid Vectors

In certain embodiments, a plasmid vector is contemplated for use to transform a host cell. In general, plasmid vectors containing replicon and control sequences which are derived from species compatible with the host cell are used in connection with these hosts. The vector ordinarily carries a replication site, as well as marking sequences which are capable of providing phenotypic selection in transformed cells. In a non-limiting example, E. coli is often transformed using derivatives of pBR322, a plasmid derived from an E. coli species. pBR322 contains genes for ampicillin and tetracycline resistance and thus provides easy means for identifying transformed cells. The pBR plasmid, or other microbial plasmid or phage must also contain, or be modified to contain, for example, promoters which can be used by the microbial organism for expression of its own proteins.

In addition, phage vectors containing replicon and control sequences that are compatible with the host microorganism can be used as transforming vectors in connection with these hosts. For example, the phage lambda GEMTMλ11 may be utilized in making a recombinant phage vector which can be used to transform host cells, such as, for example, E. coli LE392.

Further useful plasmid vectors include pIN vectors (Inouye et al., 1985); and pGEX vectors, for use in generating glutathione S-transferase (GST) soluble fusion proteins for later purification and separation or cleavage. Other suitable fusion proteins are those with β-galactosidase, ubiquitin, and the like.

Bacterial host cells, for example, E. coli, comprising the expression vector, are grown in any of a number of suitable media, for example, LB. The expression of the recombinant protein in certain vectors may be induced, as would be understood by those of skill in the art, by contacting a host cell with an agent specific for certain promoters, e.g., by adding IPTG to the media or by switching incubation to a higher temperature. After culturing the bacteria for a further period, generally of between 2 and 24 h, the cells are collected by centrifugation and washed to remove residual media.

10. Viral Vectors

The ability of certain viruses to infect cells or enter cells via receptor-mediated endocytosis, and to integrate into host cell genome and express viral genes stably and efficiently have made them attractive candidates for the transfer of foreign nucleic acids into cells (e.g., mammalian cells). Vaccine components of the present invention may be a viral vector that encode one or more folate binding protein variant antigenic compositions or other components such as, for example, a folate binding protein variant immunomodulator or adjuvant. Non-limiting examples of virus vectors that may be used to deliver a nucleic acid of the present invention are described below.

a. Adenoviral Vectors

A particular method for delivery of the nucleic acid involves the use of an adenovirus expression vector. Although adenovirus vectors are known to have a low capacity for integration into genomic DNA, this feature is counterbalanced by the high efficiency of gene transfer afforded by these vectors. “Adenovirus expression vector” is meant to include those constructs containing adenovirus sequences sufficient to (a) support packaging of the construct and (b) to ultimately express a tissue or cell-specific construct that has been cloned therein. Knowledge of the genetic organization or adenovirus, a 36 kb, linear, double-stranded DNA virus, allows substitution of large pieces of adenoviral DNA with foreign sequences up to 7 kb (Grunhaus and Horwitz, 1992).

b. AAV Vectors

The nucleic acid may be introduced into the cell using adenovirus assisted transfection. Increased transfection efficiencies have been reported in cell systems using adenovirus coupled systems (Kelleher and Vos, 1994; Cotten et al., 1992; Curiel, 1994). Adeno-associated virus (AAV) is an attractive vector system for use in the folate binding protein variant vaccines of the present invention as it has a high frequency of integration and it can infect nondividing cells, thus making it useful for delivery of genes into mammalian cells, for example, in tissue culture (Muzyczka, 1992) or in vivo. AAV has a broad host range for infectivity (Tratschin et al., 1984; Laughlin et al., 1986; Lebkowski et al., 1988; McLaughlin et al., 1988). Details concerning the generation and use of rAAV vectors are described in U.S. Pat. Nos. 5,139,941 and 4,797,368, each incorporated herein by reference.

c. Retroviral Vectors

Retroviruses have promise as folate binding protein variant antigen delivery vectors in vaccines due to their ability to integrate their genes into the host genome, transferring a large amount of foreign genetic material, infecting a broad spectrum of species and cell types and of being packaged in special cell lines (Miller, 1992).

In order to construct a folate binding protein variant vaccine retroviral vector, a nucleic acid (e.g., one encoding an folate binding protein variant antigen of interest) is inserted into the viral genome in the place of certain viral sequences to produce a virus that is replication-defective. In order to produce virions, a packaging cell line containing the gag, pol, and env genes but without the LTR and packaging components is constructed (Mann et al., 1983). When a recombinant plasmid containing a cDNA, together with the retroviral LTR and packaging sequences is introduced into a special cell line (e.g., by calcium phosphate precipitation for example), the packaging sequence allows the RNA transcript of the recombinant plasmid to be packaged into viral particles, which are then secreted into the culture media (Nicolas and Rubenstein, 1988; Temin, 1986; Mann et al., 1983). The media containing the recombinant retroviruses is then collected, optionally concentrated, and used for gene transfer. Retroviral vectors are able to infect a broad variety of cell types. However, integration and stable expression require the division of host cells (Paskind et al., 1975).

Lentiviruses are complex retroviruses, which, in addition to the common retroviral genes gag, pol, and env, contain other genes with regulatory or structural function. Lentiviral vectors are well known in the art (see, for example, Naldini et al., 1996; Zufferey et al., 1997; Blomer et al., 1997; U.S. Pat. Nos. 6,013,516 and 5,994,136). Some examples of lentivirus include the Human Immunodeficiency Viruses: HIV-1, HIV-2 and the Simian Immunodeficiency Virus: SIV. Lentiviral vectors have been generated by multiply attenuating the HIV virulence genes, for example, the genes env, vif, vpr, vpu and nef are deleted making the vector biologically safe.

Recombinant lentiviral vectors are capable of infecting non-dividing cells and can be used for both in vivo and ex vivo gene transfer and expression of nucleic acid sequences. For example, recombinant lentivirus capable of infecting a non-dividing cell wherein a suitable host cell is transfected with two or more vectors carrying the packaging functions, namely gag, pol and env, as well as rev and tat is described in U.S. Pat. No. 5,994,136, incorporated herein by reference. One may target the recombinant virus by linkage of the envelope protein with an antibody or a particular ligand for targeting to a receptor of a particular cell-type. By inserting a sequence (including a regulatory region) of interest into the viral vector, along with another gene which encodes the ligand for a receptor on a specific target cell, for example, the vector is now target-specific.

d. Other Viral Vectors

Other viral vectors may be employed as vaccine constructs in the present invention. Vectors derived from viruses such as vaccinia virus (Ridgeway, 1988; Baichwal and Sugden, 1986; Coupar et al., 1988), sindbis virus, cytomegalovirus and herpes simplex virus may be employed. They offer several attractive features for various mammalian cells (Friedmann, 1989; Ridgeway, 1988; Baichwal and Sugden, 1986; Coupar et al., 1988; Horwich et al., 1990).

e. Vaccine Delivery Using Modified Viruses

A nucleic acid to be delivered may be housed within an infective virus that has been engineered to express a specific binding ligand. The virus particle will thus bind specifically to the cognate receptors of the target cell and deliver the contents to the cell. A novel approach designed to allow specific targeting of retrovirus vectors was recently developed based on the chemical modification of a retrovirus by the chemical addition of lactose residues to the viral envelope. This modification can permit the specific infection of hepatocytes via sialoglycoprotein receptors.

Another approach to targeting of recombinant retroviruses was designed in which biotinylated antibodies against a retroviral envelope protein and against a specific cell receptor were used. The antibodies were coupled via the biotin components by using streptavidin (Roux et al., 1989). Using antibodies against major histocompatibility complex class I and class II antigens, they demonstrated the infection of a variety of human cells that bore those surface antigens with an ecotropic virus in vitro (Roux et al., 1989). Thus, it is contemplated that antibodies, specific binding ligands and/or other targeting moieties may be used to specifically transfect APC types.

11. Vector Delivery and Cell Transformation

Suitable methods for nucleic acid delivery for transformation of an organelle, a cell, a tissue or an organism for use with the current invention are believed to include virtually any method by which a nucleic acid (e.g., DNA) can be introduced into an organelle, a cell, a tissue or an organism, as described herein or as would be known to one of ordinary skill in the art. Such methods include, but are not limited to, direct delivery of DNA such as by injection (U.S. Pat. Nos. 5,994,624, 5,981,274, 5,945,100, 5,780,448, 5,736,524, 5,702,932, 5,656,610, 5,589,466 and 5,580,859, each incorporated herein by reference), including microinjection (Harlan and Weintraub, 1985; U.S. Pat. No. 5,789,215, incorporated herein by reference); by electroporation (U.S. Pat. No. 5,384,253, incorporated herein by reference; Tur-Kaspa et al., 1986; Potter et al., 1984); by calcium phosphate precipitation (Graham and Van Der Eb, 1973; Chen and Okayama, 1987; Rippe et al., 1990); by using DEAE-dextran followed by polyethylene glycol (Gopal, 1985); by direct sonic loading (Fechheimer et al., 1987); by liposome mediated transfection (Nicolau and Sene, 1982; Fraley et al., 1979; Nicolau et al., 1987; Wong et al., 1980; Kaneda et al., 1989; Kato et al., 1991) and receptor-mediated transfection (Wu and Wu, 1987; Wu and Wu, 1988); by microprojectile bombardment (PCT Application Nos. WO 94/09699 and 95/06128; U.S. Pat. Nos. 5,610,042; 5,322,783 5,563,055, 5,550,318, 5,538,877 and 5,538,880, and each incorporated herein by reference); by agitation with silicon carbide fibers (Kaeppler et al., 1990; U.S. Pat. Nos. 5,302,523 and 5,464,765, each incorporated herein by reference); by Agrobacterium-mediated transformation (U.S. Pat. Nos. 5,591,616 and 5,563,055, each incorporated herein by reference); or by PEG-mediated transformation of protoplasts (Omirulleh et al., 1993; U.S. Pat. Nos. 4,684,611 and 4,952,500, each incorporated herein by reference); by desiccation/inhibition-mediated DNA uptake (Potrykus et al., 1985), and any combination of such methods. Through the application of techniques such as these, organelle(s), cell(s), tissue(s) or organism(s) may be stably or transiently transformed.

a. Injection

In certain embodiments, a nucleic acid may be delivered to an organelle, a cell, a tissue or an organism via one or more injections (i.e., a needle injection). Methods of injection of nucleic acids are described herein, and are well known to those of ordinary skill in the art. Further embodiments of the present invention include the introduction of a nucleic acid by direct microinjection to a cell. Direct microinjection has been used to introduce nucleic acid constructs into Xenopus oocytes (Harland and Weintraub, 1985). The amount of folate binding protein variant used may vary upon the nature of the antigen as well as the organelle, cell, tissue or organism used

b. Electroporation

In certain embodiments of the present invention, a nucleic acid is introduced into an organelle, a cell, a tissue or an organism via electroporation. Electroporation involves the exposure of a suspension of cells and DNA to a high-voltage electric discharge. In some variants of this method, certain cell wall-degrading enzymes, such as pectin-degrading enzymes, are employed to render the target recipient cells more susceptible to transformation by electroporation than untreated cells (U.S. Pat. No. 5,384,253, incorporated herein by reference). Alternatively, recipient cells can be made more susceptible to transformation by mechanical wounding.

Transfection of eukaryotic cells using electroporation has been quite successful. Mouse pre-B lymphocytes have been transfected with human kappa-immunoglobulin genes (Potter et al., 1984), and rat hepatocytes have been transfected with the chloramphenicol acetyltransferase gene (Tur-Kaspa et al., 1986) in this manner.

To effect transformation by electroporation in cells such as, for example, plant cells, one may employ either friable tissues, such as a suspension culture of cells or embryogenic callus or alternatively one may transform immature embryos or other organized tissue directly. In this technique, one would partially degrade the cell walls of the chosen cells by exposing them to pectin-degrading enzymes (pectolyases) or mechanically wounding in a controlled manner. Examples of some species which have been transformed by electroporation of intact cells include maize (U.S. Pat. No. 5,384,253; Rhodes et al., 1995; D'Halluin et al., 1992), wheat (Zhou et al., 1993), tomato (Hou and Lin, 1996), soybean (Christou et al., 1987) and tobacco (Lee et al., 1989).

One also may employ protoplasts for electroporation transformation of plant cells (Bates, 1994; Lazzeri, 1995). For example, the generation of transgenic soybean plants by electroporation of cotyledon-derived protoplasts is described by Dhir and Widholm in International Patent Application No. WO 9217598, incorporated herein by reference. Other examples of species for which protoplast transformation has been described include barley (Lazerri, 1995), sorghum (Battraw et al., 1991), maize (Bhattacharjee et al., 1997), wheat (He et al., 1994) and tomato (Tsukada, 1989).

c. Calcium Phosphate

In other embodiments of the present invention, a nucleic acid is introduced to the cells using calcium phosphate precipitation. Human KB cells have been transfected with adenovirus 5 DNA (Graham and Van Der Eb, 1973) using this technique. Also in this manner, mouse L(A9), mouse C127, CHO, CV-1, BHK, NIH3T3 and HeLa cells were transfected with a neomycin marker gene (Chen and Okayama, 1987), and rat hepatocytes were transfected with a variety of marker genes (Rippe et al., 1990).

d. DEAE-Dextran

In another embodiment, a nucleic acid is delivered into a cell using DEAE-dextran followed by polyethylene glycol. In this manner, reporter plasmids were introduced into mouse myeloma and erythroleukemia cells (Gopal, 1985).

e. Liposome-Mediated Transfection

In a further embodiment of the invention, one or more vaccine components or nucleic acids may be entrapped in a lipid complex such as, for example, a liposome. Liposomes are vesicular structures characterized by a phospholipid bilayer membrane and an inner aqueous medium. Multilamellar liposomes have multiple lipid layers separated by aqueous medium. They form spontaneously when phospholipids are suspended in an excess of aqueous solution. The lipid components undergo self-rearrangement before the formation of closed structures and entrap water and dissolved solutes between the lipid bilayers (Ghosh and Bachhawat, 1991). Also contemplated is an nucleic acid complexed with Lipofectamine (Gibco BRL) or Superfect (Qiagen).

Liposome-mediated nucleic acid delivery and expression of foreign DNA in vitro has been very successful (Nicolau and Sene, 1982; Fraley et al., 1979; Nicolau et al., 1987). The feasibility of liposome-mediated delivery and expression of foreign DNA in cultured chick embryo, HeLa and hepatoma cells has also been demonstrated (Wong et al., 1980).

In certain embodiments of the invention, a liposome may be complexed with a hemagglutinating virus (HVJ). This has been shown to facilitate fusion with the cell membrane and promote cell entry of liposome-encapsulated DNA (Kaneda et al., 1989). In other embodiments, a liposome may be complexed or employed in conjunction with nuclear non-histone chromosomal proteins (HMG-1) (Kato et al., 1991). In yet further embodiments, a liposome may be complexed or employed in conjunction with both HVJ and HMG-1. In other embodiments, a delivery vehicle may comprise a ligand and a liposome.

f. Receptor Mediated Transfection

One or more vaccine components or nucleic acids, may be employed to delivered using a receptor-mediated delivery vehicle. These take advantage of the selective uptake of macromolecules by receptor-mediated endocytosis that will be occurring in the target cells. In view of the cell type-specific distribution of various receptors, this delivery method adds another degree of specificity to the present invention. Specific delivery in the context of another mammalian cell type has been described (Wu and Wu, 1993, incorporated herein by reference).

Certain receptor-mediated gene targeting vehicles comprise a cell receptor-specific ligand and a nucleic acid-binding agent. Others comprise a cell receptor-specific ligand to which the nucleic acid to be delivered has been operatively attached. Several ligands have been used for receptor-mediated gene transfer (Wu and Wu, 1987; Wagner et al., 1990; Perales et al., 1994; Myers, EPO 0273085), which establishes the operability of the technique. Specific delivery in the context of another mammalian cell type has been described (Wu and Wu, 1993; incorporated herein by reference). In certain aspects of the present invention, a ligand will be chosen to correspond to a receptor specifically expressed on the target cell population.

In other embodiments, a nucleic acid delivery vehicle component of a cell-specific nucleic acid targeting vehicle may comprise a specific binding ligand in combination with a liposome. The nucleic acid(s) to be delivered are housed within the liposome and the specific binding ligand is functionally incorporated into the liposome membrane. The liposome will thus specifically bind to the receptor(s) of a target cell and deliver the contents to a cell. Such systems have been shown to be functional using systems in which, for example, epidermal growth factor (EGF) is used in the receptor-mediated delivery of a nucleic acid to cells that exhibit upregulation of the EGF receptor.

In still further embodiments, the nucleic acid delivery vehicle component of a targeted delivery vehicle may be a liposome itself, which will preferably comprise one or more lipids or glycoproteins that direct cell-specific binding. For example, lactosyl-ceramide, a galactose-terminal asialganglioside, have been incorporated into liposomes and observed an increase in the uptake of the insulin gene by hepatocytes (Nicolau et al., 1987). It is contemplated that the tissue-specific transforming constructs of the present invention can be specifically delivered into a target cell in a similar manner.

g. Microprojectile Bombardment

Microprojectile bombardment techniques can be used to introduce a nucleic acid into at least one, organelle, cell, tissue or organism (U.S. Pat. No. 5,550,318; U.S. Pat. No. 5,538,880; U.S. Pat. No. 5,610,042; and PCT Application WO 94/09699; each of which is incorporated herein by reference). This method depends on the ability to accelerate DNA-coated microprojectiles to a high velocity allowing them to pierce cell membranes and enter cells without killing them (Klein et al., 1987). There are a wide variety of microprojectile bombardment techniques known in the art, many of which are applicable to the invention.

Microprojectile bombardment may be used to transform various cell(s), tissue(s) or organism(s), such as for example any plant species. Examples of species which have been transformed by microprojectile bombardment include monocot species such as maize (PCT Application WO 95/06128), barley (Ritala et al., 1994; Hensgens et al., 1993), wheat (U.S. Pat. No. 5,563,055, incorporated herein by reference), rice (Hensgens et al., 1993), oat (Torbet et al., 1995; Torbet et al., 1998), rye (Hensgens et al., 1993), sugarcane (Bower et al., 1992), and sorghum (Casas et al., 1993; Hagio et al., 1991); as well as a number of dicots including tobacco (Tomes et al., 1990; Buising and Benbow, 1994), soybean (U.S. Pat. No. 5,322,783, incorporated herein by reference), sunflower (Knittel et al. 1994), peanut (Singsit et al., 1997), cotton (McCabe and Martinell, 1993), tomato (VanEck et al. 1995), and legumes in general (U.S. Pat. No. 5,563,055, incorporated herein by reference).

In this microprojectile bombardment, one or more particles may be coated with at least one nucleic acid and delivered into cells by a propelling force. Several devices for accelerating small particles have been developed. One such device relies on a high voltage discharge to generate an electrical current, which in turn provides the motive force (Yang et al., 1990). The microprojectiles used have consisted of biologically inert substances such as tungsten or gold particles or beads. Exemplary particles include those comprised of tungsten, platinum, and preferably, gold. It is contemplated that in some instances DNA precipitation onto metal particles would not be necessary for DNA delivery to a recipient cell using microprojectile bombardment. However, it is contemplated that particles may contain DNA rather than be coated with DNA. DNA-coated particles may increase the level of DNA delivery via particle bombardment but are not, in and of themselves, necessary.

For the bombardment, cells in suspension are concentrated on filters or solid culture medium. Alternatively, immature embryos or other target cells may be arranged on solid culture medium. The cells to be bombarded are positioned at an appropriate distance below the macroprojectile stopping plate.

12. Host Cells

As used herein, the terms “cell,” “cell line,” and “cell culture” may be used interchangeably. All of these terms also include their progeny, which is any and all subsequent generations. It is understood that all progeny may not be identical due to deliberate or inadvertent mutations. In the context of expressing a heterologous nucleic acid sequence, “host cell” refers to a prokaryotic or eukaryotic cell, and it includes any transformable organisms that is capable of replicating a vector and/or expressing a heterologous gene encoded by a vector. A host cell can, and has been, used as a recipient for vectors. A host cell may be “transfected” or “transformed,” which refers to a process by which exogenous nucleic acid is transferred or introduced into the host cell. A transformed cell includes the primary subject cell and its progeny. As used herein, the terms “engineered” and “recombinant” cells or host cells are intended to refer to a cell into which an exogenous nucleic acid sequence, such as, for example, a vector, has been introduced. Therefore, recombinant cells are distinguishable from naturally occurring cells which do not contain a recombinantly introduced nucleic acid.

In certain embodiments, it is contemplated that RNAs or proteinaceous sequences may be co-expressed with other selected RNAs or proteinaceous sequences in the same host cell. Co-expression may be achieved by co-transfecting the host cell with two or more distinct recombinant vectors. Alternatively, a single recombinant vector may be constructed to include multiple distinct coding regions for RNAs, which could then be expressed in host cells transfected with the single vector.

A tissue may comprise a host cell or cells to be transformed with a folate binding protein variant. The tissue may be part or separated from an organism. In certain embodiments, a tissue may comprise, but is not limited to, adipocytes, alveolar, ameloblasts, axon, basal cells, blood (e.g., lymphocytes), blood vessel, bone, bone marrow, brain, breast, cartilage, cervix, colon, cornea, embryonic, endometrium, endothelial, epithelial, esophagus, facia, fibroblast, follicular, ganglion cells, glial cells, goblet cells, kidney, liver, lung, lymph node, muscle, neuron, ovaries, pancreas, peripheral blood, prostate, skin, skin, small intestine, spleen, stem cells, stomach, testes, anthers, ascite tissue, cobs, ears, flowers, husks, kernels, leaves, meristematic cells, pollen, root tips, roots, silk, stalks, and all cancers thereof.

In certain embodiments, the host cell or tissue may be comprised in at least one organism. In certain embodiments, the organism may be, but is not limited to, a prokayote (e.g, a eubacteria, an archaea) or an eukaryote, as would be understood by one of ordinary skill in the art.

Numerous cell lines and cultures are available for use as a host cell, and they can be obtained through the American Type Culture Collection (ATCC), which is an organization that serves as an archive for living cultures and genetic materials. An appropriate host can be determined by one of skill in the art based on the vector backbone and the desired result. A plasmid or cosmid, for example, can be introduced into a prokaryote host cell for replication of many vectors. Cell types available for vector replication and/or expressioninclude, but are not limited to, bacteria, such as E. coli (e.g, E. coli strain RR1, E. coli LE392, E. coli B, E. coli X 1776 (ATCC No. 31537) as well as E. coli W3110 (F′, lambda, prototrophic, ATCC No. 273325), bacilli such as Bacillus subtilis; and other enterobacteriaceaeo such as Salmonella typhimurium, Serratia marcescens, various Pseudomonas specie, DH5a, JM109, and KC8, as well as a number of commercially available bacterial hosts such as SURE® Competent Cells and SOLOPACKä Gold Cells (STRATAGENE®, La Jolla). In certain embodiments, bacterial cells such as E. coli LE392 are particularly contemplated as host cells for phage viruses.

Examples of eukaryotic host cells for replication and/or expression of a vector include, but are not limited to, HeLa, NIH3T3, Jurkat, 293, Cos, CHO, Saos, and PC12. Many host cells from various cell types and organisms are available and would be known to one of skill in the art. Similarly, a viral vector may be used in conjunction with either a eukaryotic or prokaryotic host cell, particularly one that is permissive for replication or expression of the vector.

Some vectors may employ control sequences that allow it to be replicated and/or expressed in both prokaryotic and eukaryotic cells. One of skill in the art would further understand the conditions under which to incubate all of the above described host cells to maintain them and to permit replication of a vector. Also understood and known are techniques and conditions that would allow large-scale production of vectors, as well as production of the nucleic acids encoded by vectors and their cognate polypeptides, proteins, or peptides.

13. Expression Systems

Numerous expression systems exist that comprise at least a part or all of the compositions discussed above. Prokaryote- and/or eukaryote-based systems can be employed for use with the present invention to produce nucleic acid sequences, or their cognate polypeptides, proteins and peptides. Many such systems are commercially and widely available.

The insect cellibaculovirus system can produce a high level of protein expression of a heterologous nucleic acid segment, such as described in U.S. Pat. Nos. 5,871,986, 4,879,236, both herein incorporated by reference, and which can be bought, for example, under the name MAXBAC® 2.0 from INVITROGEN® and BACPACK™ BACULOVIRUS EXPRESSION SYSTEM FROM CLONTECH®.

Other examples of expression systems include STRATAGENE®'s COMPLETE CONTROLä Inducible Mammalian Expression System, which involves a synthetic ecdysone-inducible receptor, or its pET Expression System, an E. coli expression system. Another example of an inducible expression system is available from INVITROGEN®, which carries the T-REX™ (tetracycline-regulated expression) System, an inducible mammalian expression system that uses the full-length CMV promoter. INVITROGEN® also provides a yeast expression system called the Pichia methanolica Expression System, which is designed for high-level production of recombinant proteins in the methylotrophic yeast Pichia methanolica. One of skill in the art would know how to express a vector, such as an expression construct, to produce a nucleic acid sequence or its cognate polypeptide, protein, or peptide.

It is contemplated that the proteins, polypeptides or peptides produced by the methods of the invention may be “overexpressed”, i.e., expressed in increased levels relative to its natural expression in cells. Such overexpression may be assessed by a variety of methods, including radiolabeling and/or protein purification. However, simple and direct methods are preferred, for example, those involving SDS/PAGE and protein staining or western blotting, followed by quantitative analyses, such as densitometric scanning of the resultant gel or blot. A specific increase in the level of the recombinant protein, polypeptide or peptide in comparison to the level in natural cells is indicative of overexpression, as is a relative abundance of the specific protein, polypeptides or peptides in relation to the other proteins produced by the host cell and, e.g., visible on a gel.

In some embodiments, the expressed proteinaceous sequence forms an inclusion body in the host cell, the host cells are lysed, for example, by disruption in a cell homogenizer, washed and/or centrifuged to separate the dense inclusion bodies and cell membranes from the soluble cell components. This centrifugation can be performed under conditions whereby the dense inclusion bodies are selectively enriched by incorporation of sugars, such as sucrose, into the buffer and centrifugation at a selective speed. Inclusion bodies may be solubilized in solutions containing high concentrations of urea (e.g. 8 M) or chaotropic agents such as guanidine hydrochloride in the presence of reducing agents, such as β-mercaptoethanol or DTT (dithiothreitol), and refolded into a more desirable conformation, as would be known to one of ordinary skill in the art.

G. Vaccine Component Purification

In any case, a vaccine component (e.g., an antigenic peptide or polypeptide or nucleic acid encoding a proteinaceous composition) may be isolated and/or purified from the chemical synthesis reagents, cell or cellular components. In a method of producing the vaccine component, purification is accomplished by any appropriate technique that is described herein or well known to those of skill in the art (e.g., Sambrook et al., 1987). Although preferred for use in certain embodiments, there is no general requirement that an antigenic composition of the present invention or other vaccine component always be provided in their most purified state. Indeed, it is contemplated that a less substantially purified vaccine component, which is nonetheless enriched in the desired compound, relative to the natural state, will have utility in certain embodiments, such as, for example, total recovery of protein product, or in maintaining the activity of an expressed protein. However, it is contemplate that inactive products also have utility in certain embodiments, such as, e.g., in determining antigenicity via antibody generation.

The present invention also provides purified, and in preferred embodiments, substantially purified vaccines or vaccine components. The term “purified vaccine component” as used herein, is intended to refer to at least one vaccine component (e.g., a proteinaceous composition, isolatable from cells), wherein the component is purified to any degree relative to its naturally-obtainable state, e.g., relative to its purity within a cellular extract or reagents of chemical synthesis. In certain aspects wherein the vaccine component is a proteinaceous composition, a purified vaccine component also refers to a wild-type or mutant protein, polypeptide, or peptide free from the environment in which it naturally occurs.

Where the term “substantially purified” is used, this will refer to a composition in which the specific compound (e.g., a protein, polypeptide, or peptide) forms the major component of the composition, such as constituting about 50% of the compounds in the composition or more. In preferred embodiments, a substantially purified vaccine component will constitute more than about 60%, about 70%, about 80%, about 90%, about 95%, about 99% or even more of the compounds in the composition.

In certain embodiments, a vaccine component may be purified to homogeneity. As applied to the present invention, “purified to homogeneity,” means that the vaccine component has a level of purity where the compound is substantially free from other chemicals, biomolecules or cells. For example, a purified peptide, polypeptide or protein will often be sufficiently free of other protein components so that degradative sequencing may be performed successfully. Various methods for quantifying the degree of purification of a vaccine component will be known to those of skill in the art in light of the present disclosure. These include, for example, determining the specific protein activity of a fraction (e.g., antigenicity), or assessing the number of polypeptides within a fraction by gel electrophoresis.

Various techniques suitable for use in chemical, biomolecule or biological purification, well known to those of skill in the art, may be applicable to preparation of a vaccine component of the present invention. These include, for example, precipitation with ammonium sulfate, PEG, antibodies and the like or by heat denaturation, followed by centrifugation; fractionation, chromatographic procedures, including but not limited to, partition chromatograph (e.g., paper chromatograph, thin-layer chromatograph (TLC), gas-liquid chromatography and gel chromatography) gas chromatography, high performance liquid chromatography, affinity chromatography, supercritical flow chromatography ion exchange, gel filtration, reverse phase, hydroxylapatite, lectin affinity; isoelectric focusing and gel electrophoresis (see for example, Sambrook et al. 1989; and Freifelder, Physical Biochemistry, Second Edition, pages 238-246, incorporated herein by reference).

Given many DNA and proteins are known (see for example, the National Center for Biotechnology Information's Genbank® and GenPept databases, or may be identified and amplified using the methods described herein, any purification method for recombinately expressed nucleic acid or proteinaceous sequences known to those of skill in the art can now be employed. In certain aspects, a nucleic acid may be purified on polyacrylamide gels, and/or cesium chloride centrifugation gradients, or by any other means known to one of ordinary skill in the art (see for example, Sambrook et al. 1989, incorporated herein by reference). In further aspects, a purification of a proteinaceous sequence may be conducted by recombinately expressing the sequence as a fusion protein. Such purification methods are routine in the art. This is exemplified by the generation of an specific protein-glutathione S-transferase fusion protein, expression in E. coli, and isolation to homogeneity using affinity chromatography on glutathione-agarose or the generation of a polyhistidine tag on the N- or C-terminus of the protein, and subsequent purification using Ni-affinity chromatography. In particular aspects, cells or other components of the vaccine may be purified by flow cytometry. Flow cytometry involves the separation of cells or other particles in a liquid sample, and is well known in the art (see, for example, U.S. Pat. Nos. 3,826,364, 4,284,412, 4,989,977, 4,498,766, 5,478,722, 4,857,451, 4,774,189, 4,767,206, 4,714,682, 5,160,974 and 4,661,913). Any of these techniques described herein, and combinations of these and any other techniques known to skilled artisans, may be used to purify and/or assay the purity of the various chemicals, proteinaceous compounds, nucleic acids, cellular materials and/or cells that may comprise a vaccine of the present invention. As is generally known in the art, it is believed that the order of conducting the various purification steps may be changed, or that certain steps may be omitted, and still result in a suitable method for the preparation of a substantially purified antigen or other vaccine component.

H. Additional Vaccine Components

It is contemplated that an antigenic composition of the invention may be combined with one or more additional components to form a more effective vaccine. Non-limiting examples of additional components include, for example, one or more additional antigens, immunomodulators or adjuvants to stimulate an immune response to an antigenic composition of the present invention and/or the additional component(s).

1. Immunomodulators

For example, it is contemplated that immunomodulators can be included in the vaccine to augment a cell's or a patient's (e.g., an animal's) response. Immunomodulators can be included as purified proteins, nucleic acids encoding immunomodulators, and/or cells that express immunomodulators in the vaccine composition. The following sections list non-limiting examples of immunomodulators that are of interest, and it is contemplated that various combinations of immunomodulators may be used in certain embodiments (e.g., a cytokine and a chemokine).

In another aspects of the invention, it is contemplated that the folate binding protein variant composition may further comprise a therapeutically effective composition of an immunomodulator. It is envisioned that an immunomodulator would constitute a cytokine, hematapoietin, colony stimulating factor, interleukin, interferon, growth factor or combination thereof. As used herein certain embodiments, the terms “cytokine” are the same as described in U.S. Pat. No. 5,851,984, incorporated herein by reference in its entirety, which reads in relevant part:

“The term ‘cytokine’ is a generic term for proteins released by one cell population which act on another cell as intercellular mediators. Examples of such cytokines are lymphokines, monokines, growth factors and traditional polypeptide hormones. Included among the cytokines are growth hormones such as human growth hormone, N-methionyl human growth hormone, and bovine growth hormone; parathyroid hormone; thyroxine; insulin; proinsulin; relaxin; prorelaxin; glycoprotein hormones such as follicle stimulating hormone (FSH), thyroid stimulating hormone (TSH), and luteinizing hormone (LH); hepatic growth factor; prostaglandin, fibroblast growth factor; prolactin; placental lactogen, OB protein; tumor necrosis factor-.alpha. and -.beta.; mullerian-inhibiting substance; mouse gonadotropin-associated peptide; inhibin; activin; vascular endothelial growth factor; integrin; thrombopoietin (TPO); nerve growth factors such as NGF-.beta.; platelet-growth factor; transforming growth factors (TGFs) such as TGF-.alpha. and TGF-.beta.; insulin-like growth factor-I and -II; erythropoietin (EPO); osteoinductive factors; interferons such as interferon-a, -.b, and -g; colony stimulating factors (CSFs) such as macrophage-CSF (M-CSF); granulocyte-macrophage-CSF (GM-CSF); and granulocyte-CSF (G-CSF); interleukins (ILs) such as IL-1, IL-1.alpha., IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-11, IL-12; IL-13, IL-14, IL-15, IL-16, IL-17, IL-18, LIF, G-CSF, GM-CSF, M-CSF, EPO, kit-ligand or FLT-3. As used herein, the term cytokine includes proteins from natural sources or from recombinant cell culture and biologically active equivalents of the native sequence cytokines.

a. β-interferon

β-interferon (IFN-b) is low molecular weight protein that is produced by many cell types, including epithelial cells, fibroblasts and macrophages. Cells that express endogenous IFN-b are resistant to viral infection and replication. The b-interferon genes from mouse (GenBank® accession numbers X14455, X14029) and human (GenBank® accession numbers J00218, K00616 and M11029) have been isolated and sequenced. IFN-b is a multifunctional glycoprotein that can inhibit tumor growth both directly, by suppressing cell replication and inducing differentiation or apoptosis and indirectly by activating tumoricidal properties of macrophages and NK cells, by suppressing tumor angiogenesis and by stimulating specific immune response.

b. Interleukin-2

Interleukin-2 (IL-2), originally designated T-cell growth factor I, is a highly proficient inducer of T-cell proliferation and is a growth factor for all subpopulations of T-lymphocytes. IL-2 is an antigen independent proliferation factor that induces cell cycle progression in resting cells and thus allows clonal expansion of activated T-lymphocytes. Since freshly isolated leukemic cells also secrete IL2 and respond to it IL2 may function as an autocrine growth modulator for these cells capable of worsening ATL. IL2 also promotes the proliferation of activated B-cells although this requires the presence of additional factors, for example, IL4. In vitro IL2 also stimulates the growth of oligodendroglial cells. Due to its effects on T-cells and B-cells IL2 is a central regulator of immune responses. It also plays a role in anti-inflammatory reactions, in hematopoiesis and in tumor surveillance. IL-2 stimulates the synthesis of IFN-g in peripheral leukocytes and also induces the secretion of IL-1, TNF-a and TNF-b. The induction of the secretion of tumoricidal cytokines, apart from the activity in the expansion of LAK cells, (lymphokine-activated killer cells) are probably the main factors responsible for the antitumor activity of IL2.

c. GM-CSF

GM-CSF stimulates the proliferation and differentiation of neutrophilic, eosinophilic, and monocytic lineages. It also functionally activates the corresponding mature forms, enhancing, for example, to the expression of certain cell surface adhesion proteins (CD-11A, CD-11C). The overexpression of these proteins could be one explanation for the observed local accumulation of granulocytes at sites of inflammation. In addition, GM-CSF also enhances expression of receptors for FMLP (Formyl-Met-Leu-Phe) which is a stimulator of neutrophil activity.

d. Cytokines

Interleukins, cytokines, nucleic acids encoding interleukins or cytokines, and/or cells expressing such compounds are contemplated as possible vaccine components. Interleukins and cytokines, include but are not limited to interleukin 1 (IL-1), IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, IL-13, IL-14, IL-15, IL-18, β-interferon, α-interferon, γ-interferon, angiostatin, thrombospondin, endostatin, GM-CSF, G-CSF, M-CSF, METH-1, METH-2, tumor necrosis factor, TGFb, LT and combinations thereof.

e. Chemokines

Chemokines, nucleic acids that encode for chemokines, and/or cells that express such also may be used as vaccine components. Chemokines generally act as chemoattractants to recruit immune effector cells to the site of chemokine expression. It may be advantageous to express a particular chemokine coding sequence in combination with, for example, a cytokine coding sequence, to enhance the recruitment of other immune system components to the site of treatment. Such chemokines include, for example, RANTES, MCAF, MIP1-alpha, MIP1-Beta, IP-10 and combinations thereof. The skilled artisan will recognize that certain cytokines are also known to have chemoattractant effects and could also be classified under the term chemokines.

f. Immunogenic Carrier Proteins

In certain embodiments, an antigenic composition's may be chemically coupled to a carrier or recombinantly expressed with a immunogenic carrier peptide or polypetide (e.g., a antigen-carrier fusion peptide or polypeptide) to enhance an immune reaction. Exemplary and preferred immunogenic carrier amino acid sequences include hepatitis B surface antigen, keyhole limpet hemocyanin (KLH) and bovine serum albumin (BSA). Other albumins such as ovalbumin, mouse serum albumin or rabbit serum albumin also can be used as immunogenic carrier proteins. Means for conjugating a polypeptide or peptide to a immunogenic carrier protein are well known in the art and include, for example, glutaraldehyde, m-maleimidobenzoyl-N-hydroxysuccinimide ester, carbodiimide and bis-biazotized benzidine.

g. Biological Response Modifiers

It may be desirable to coadminister biologic response modifiers (BRM), which have been shown to upregulate T cell immunity or downregulate suppressor cell activity. Such BRMs include, but are not limited to, cimetidine (CIM; 1200 mg/d) (Smith/Kline, PA); low-dose cyclophosphamide (CYP; 300 mg/m2) (Johnson/Mead, NJ), or a gene encoding a protein involved in one or more immune helper functions, such as B-7.

2. Adjuvants

Immunization protocols have used adjuvants to stimulate responses for many years, and as such adjuvants are well known to one of ordinary skill in the art. Some adjuvants affect the way in which antigens are presented. For example, the immune response is increased when protein antigens are precipitated by alum. Emulsification of antigens also prolongs the duration of antigen presentation.

In one aspect, an adjuvant effect is achieved by use of an agent such as alum used in about 0.05 to about 0.1% solution in phosphate buffered saline. Alternatively, the antigen is made as an admixture with synthetic polymers of sugars (Carbopol®) used as an about 0.25% solution. Adjuvant effect may also be made my aggregation of the antigen in the vaccine by heat treatment with temperatures ranging between about 70° to about 101° C. for a 30-second to 2-minute period, respectively. Aggregation by reactivating with pepsin treated (Fab) antibodies to albumin, mixture with bacterial cell(s) such as C. parvum or an endotoxin or a lipopolysaccharide components of Gram-negative bacteria, emulsion in physiologically acceptable oil vehicles such as mannide mono-oleate (Aracel A) or emulsion with a 20% solution of a perfluorocarbon (Fluosol-DA®) used as a block substitute also may be employed.

Some adjuvants, for example, are certain organic molecules obtained from bacteria, act on the host rather than on the antigen. An example is muramyl dipeptide (N-acetylmuramyl-L-alanyl-D-isoglutamine [MDP]), a bacterial peptidoglycan. The effects of MDP, as with most adjuvants, are not fully understood. MDP stimulates macrophages but also appears to stimulate B cells directly. The effects of adjuvants, therefore, are not antigen-specific. If they are administered together with a purified antigen, however, they can be used to selectively promote the response to the antigen.

Adjuvants have been used experimentally to promote a generalized increase in immunity against unknown antigens (e.g., U.S. Pat. No. 4,877,611). This has been attempted particularly in the treatment of cancer. For many cancers, there is compelling evidence that the immune system participates in host defense against the tumor cells, but only a fraction of the likely total number of tumor-specific antigens are believed to have been identified to date. However, using the present invention, the inclusion of a suitable adjuvant into the membrane of an irradiated tumor cell will likely increase the anti-tumor response irrespective of the molecular identification of the prominent antigens. This is a particularly important and time-saving feature of the invention.

In certain embodiments, hemocyanins and hemoerythrins may also be used in the invention. The use of hemocyanin from keyhole limpet (KLH) is preferred in certain embodiments, although other molluscan and arthropod hemocyanins and hemoerythrins may be employed.

Various polysaccharide adjuvants may also be used. For example, the use of various pneumococcal polysaccharide adjuvants on the antibody responses of mice has been described (Yin et al., 1989). The doses that produce optimal responses, or that otherwise do not produce suppression, should be employed as indicated (Yin et al., 1989). Polyamine varieties of polysaccharides are particularly preferred, such as chitin and chitosan, including deacetylated chitin.

Another group of adjuvants are the muramyl dipeptide (MDP, N-acetylmuramyl-L-alanyl-D-isoglutamine) group of bacterial peptidoglycans. Derivatives of muramyl dipeptide, such as the amino acid derivative threonyl-MDP, and the fatty acid derivative MTPPE, are also contemplated.

U.S. Pat. No. 4,950,645 describes a lipophilic disaccharide-tripeptide derivative of muramyl dipeptide which is described for use in artificial liposomes formed from phosphatidyl choline and phosphatidyl glycerol. It is the to be effective in activating human monocytes and destroying tumor cells, but is non-toxic in generally high doses. The compounds of U.S. Pat. No. 4,950,645 and PCT Patent Application WO 91/16347, are contemplated for use with cellular carriers and other embodiments of the present invention.

Another adjuvant contemplated for use in the present invention is BCG. BCG (bacillus Calmette-Guerin, an attenuated strain of Mycobacterium) and BCG-cell wall skeleton (CWS) may also be used as adjuvants in the invention, with or without trehalose dimycolate. Trehalose dimycolate may be used itself. Trehalose dimycolate administration has been shown to correlate with augmented resistance to influenza virus infection in mice (Azuma et al., 1988). Trehalose dimycolate may be prepared as described in U.S. Pat. No. 4,579,945.

BCG is an important clinical tool because of its immunostimulatory properties. BCG acts to stimulate the reticulo-endothelial system, activates natural killer cells and increases proliferation of hematopoietic stem cells. Cell wall extracts of BCG have proven to have excellent immune adjuvant activity. Molecular genetic tools and methods for mycobacteria have provided the means to introduce foreign genes into BCG (Jacobs et al., 1987; Snapper et al., 1988; Husson et al., 1990; Martin et al., 1990).

Live BCG is an effective and safe vaccine used worldwide to prevent tuberculosis. BCG and other mycobacteria are highly effective adjuvants, and the immune response to mycobacteria has been studied extensively. With nearly 2 billion immunizations, BCG has a long record of safe use in man (Luelmo, 1982; Lotte et al., 1984). It is one of the few vaccines that can be given at birth, it engenders long-lived immune responses with only a single dose, and there is a worldwide distribution network with experience in BCG vaccination. An exemplary BCG vaccine is sold as TICE™ BCG (Organon Inc., West Orange, N.J.).

In a typical practice of the present invention, cells of Mycobacterium bovis-BCG are grown and harvested by methods known in the art. For example, they may be grown as a surface pellicle on a Sauton medium or in a fermentation vessel containing the dispersed culture in a Dubos medium (Dubos et al., 1947; Rosenthal, 1937). All the cultures are harvested after 14 days incubation at about 37° C. Cells grown as a pellicle are harvested by using a platinum loop whereas those from the fermenter are harvested by centrifugation or tangential-flow filtration. The harvested cells are resuspended in an aqueous sterile buffer medium. A typical suspension contains from about 2×10¹⁰ cells/ml to about 2×10¹² cells/ml. To this bacterial suspension, a sterile solution containing a selected enzyme which will degrade the BCG cell covering material is added. The resultant suspension is agitated such as by stirring to ensure maximal dispersal of the BCG organisms. Thereafter, a more concentrated cell suspension is prepared and the enzyme in the concentrate removed, typically by washing with an aqueous buffer, employing known techniques such as tangential-flow filtration. The enzyme-free cells are adjusted to an optimal immunological concentration with a cryoprotectant solution, after which they are filled into vials, ampoules, etc., and lyophilized, yielding BCG vaccine, which upon reconstitution with water is ready for immunization.

Amphipathic and surface active agents, e.g., saponin and derivatives such as QS21 (Cambridge Biotech), form yet another group of adjuvants for use with the immunogens of the present invention. Nonionic block copolymer surfactants (Rabinovich et al., 1994; Hunter et al., 1991) may also be employed. Oligonucleotides are another useful group of adjuvants (Yamamoto et al., 1988). Quil A and lentinen are other adjuvants that may be used in certain embodiments of the present invention.

One group of adjuvants preferred for use in the invention are the detoxified endotoxins, such as the refined detoxified endotoxin of U.S. Pat. No. 4,866,034. These refined detoxified endotoxins are effective in producing adjuvant responses in mammals. Of course, the detoxified endotoxins may be combined with other adjuvants to prepare multi-adjuvant-incorporated cells. For example, combination of detoxified endotoxins with trehalose dimycolate is particularly contemplated, as described in U.S. Pat. No. 4,435,386. Combinations of detoxified endotoxins with trehalose dimycolate and endotoxic glycolipids is also contemplated (U.S. Pat. No. 4,505,899), as is combination of detoxified endotoxins with cell wall skeleton (CWS) or CWS and trehalose dimycolate, as described in U.S. Pat. Nos. 4,436,727, 4,436,728 and 4,505,900. Combinations of just CWS and trehalose dimycolate, without detoxified endotoxins, is also envisioned to be useful, as described in U.S. Pat. No. 4,520,019.

In other embodiments, the present invention contemplates that a variety of adjuvants may be employed in the membranes of cells, resulting in an improved immunogenic composition. The only requirement is, generally, that the adjuvant be capable of incorporation into, physical association with, or conjugation to, the cell membrane of the cell in question. Those of skill in the art will know the different kinds of adjuvants that can be conjugated to cellular vaccines in accordance with this invention and these include alkyl lysophosphilipids (ALP); BCG; and biotin (including biotinylated derivatives) among others. Certain adjuvants particularly contemplated for use are the teichoic acids from Gram positive cells. These include the lipoteichoic acids (LTA), ribitol teichoic acids (RTA) and glycerol teichoic acid (GTA). Active forms of their synthetic counterparts may also be employed in connection with the invention (Takada et al., 1995a).

Various adjuvants, even those that are not commonly used in humans, may still be employed in animals, where, for example, one desires to raise antibodies or to subsequently obtain activated T cells. The toxicity or other adverse effects that may result from either the adjuvant or the cells, e.g., as may occur using non-irradiated tumor cells, is irrelevant in such circumstances.

One group of adjuvants preferred for use in some embodiments of the present invention are those that can be encoded by a nucleic acid (e.g., DNA or RNA). It is contemplated that such adjuvants may be encoded in a nucleic acid (e.g., an expression vector) encoding the antigen, or in a separate vector or other construct. These nucleic acids encoding the adjuvants can be delivered directly, such as for example with lipids or liposomes.

3. Excipients, Salts and Auxiliary Substances

An antigenic composition of the present invention may be mixed with one or more additional components (e.g., excipients, salts, etc.) which are pharmaceutically acceptable and compatible with at least one active ingredient (e.g., antigen). Suitable excipients are, for example, water, saline, dextrose, glycerol, ethanol and combinations thereof.

An antigenic composition of the present invention may be formulated into the vaccine as a neutral or salt form. A pharmaceutically-acceptable salt, includes the acid addition salts (formed with the free amino groups of the peptide) and those which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acid, or such organic acids as acetic, oxalic, tartaric, mandelic, and the like. A salt formed with a free carboxyl group also may be derived from an inorganic base such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxide, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, procaine, and combinations thereof.

In addition, if desired, an antigentic composition may comprise minor amounts of one or more auxiliary substances such as for example wetting or emulsifying agents, pH buffering agents, etc. which enhance the effectiveness of the antigenic composition or vaccine.

I. Vaccine Preparations

Once produced, synthesized and/or purified, an antigen or other vaccine component may be prepared as a vaccine for administration to a patient. The preparation of a vaccine is generally well understood in the art, as exemplified by U.S. Pat. Nos. 4,608,251, 4,601,903, 4,599,231, 4,599,230, and 4,596,792, all incorporated herein by reference. Such methods may be used to prepare a vaccine comprising an antigenic composition comprising folate binding protein epitopes and/or variants as active ingredient(s), in light of the present disclosure. In preferred embodiments, the compositions of the present invention are prepared to be pharmacologically acceptable vaccines.

Pharmaceutical vaccine compositions of the present invention comprise an effective amount of one or more folate binding protein epitopes and/or variants or additional agent dissolved or dispersed in a pharmaceutically acceptable carrier. The phrases “pharmaceutical or pharmacologically acceptable” refers to molecular entities and compositions that do not produce an adverse, allergic or other untoward reaction when administered to an animal, such as, for example, a human, as appropriate. The preparation of an pharmaceutical composition that contains at least one folate binding protein epitope or additional active ingredient will be known to those of skill in the art in light of the present disclosure, as exemplified by Remington's Pharmaceutical Sciences, 18th Ed. Mack Printing Company, 1990, incorporated herein by reference. Moreover, for animal (e.g., human) administration, it will be understood that preparations should meet sterility, pyrogenicity, general safety and purity standards as required by FDA Office of Biological Standards.

As used herein, “pharmaceutically acceptable carrier” includes any and all solvents, dispersion media, coatings, surfactants, antioxidants, preservatives (e.g., antibacterial agents, antifungal agents), isotonic agents, absorption delaying agents, salts, preservatives, drugs, drug stabilizers, binders, excipients, disintegration agents, lubricants, sweetening agents, flavoring agents, dyes, such like materials and combinations thereof, as would be known to one of ordinary skill in the art (see, for example, Remington's Pharmaceutical Sciences, 18th Ed. Mack Printing Company, 1990, pp. 1289-1329, incorporated herein by reference). The folate binding protein variant may comprise different types of carriers depending on whether it is to be administered in solid, liquid or aerosol form, and whether it need to be sterile for such routes of administration as injection. Except insofar as any conventional carrier is incompatible with the active ingredient, its use in the therapeutic or pharmaceutical compositions is contemplated.

In any case, the composition may comprise various antioxidants to retard oxidation of one or more component. Additionally, the prevention of the action of microorganisms can be brought about by preservatives such as various antibacterial and antifungal agents, including but not limited to parabens (e.g., methylparabens, propylparabens), chlorobutanol, phenol, sorbic acid, thimerosal or combinations thereof.

The folate binding protein variant may be formulated into a composition in a free base, neutral or salt form. Pharmaceutically acceptable salts, include the acid addition salts, e.g., those formed with the free amino groups of a proteinaceous composition, or which are formed with inorganic acids such as for example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric or mandelic acid. Salts formed with the free carboxyl groups can also be derived from inorganic bases such as for example, sodium, potassium, ammonium, calcium or ferric hydroxides; or such organic bases as isopropylamine, trimethylamine, histidine or procaine.

In embodiments where the composition is in a liquid form, a carrier can be a solvent or dispersion medium comprising but not limited to, water, ethanol, polyol (e.g., glycerol, propylene glycol, liquid polyethylene glycol, etc.), lipids (e.g., triglycerides, vegetable oils, liposomes) and combinations thereof. The proper fluidity can be maintained, for example, by the use of a coating, such as lecithin; by the maintenance of the required particle size by dispersion in carriers such as, for example liquid polyol or lipids; by the use of surfactants such as, for example hydroxypropylcellulose; or combinations thereof such methods. In many cases, it will be preferable to include isotonic agents, such as, for example, sugars, sodium chloride or combinations thereof.

In other embodiments, one may use nasal solutions or sprays, aerosols or inhalants in the present invention. Such compositions are generally designed to be compatible with the target tissue type. In a non-limiting example, nasal solutions are usually aqueous solutions designed to be administered to the nasal passages in drops or sprays. Nasal solutions are prepared so that they are similar in many respects to nasal secretions, so that normal ciliary action is maintained. Thus, in preferred embodiments the aqueous nasal solutions usually are isotonic or slightly buffered to maintain a pH of about 5.5 to about 6.5. In addition, antimicrobial preservatives, similar to those used in ophthalmic preparations, drugs, or appropriate drug stabilizers, if required, may be included in the formulation. For example, various commercial nasal preparations are known and include drugs such as antibiotics or antihistamines.

In certain embodiments the folate binding protein variant is prepared for administration by such routes as oral ingestion. In these embodiments, the solid composition may comprise, for example, solutions, suspensions, emulsions, tablets, pills, capsules (e.g., hard or soft shelled gelatin capsules), sustained release formulations, buccal compositions, troches, elixirs, suspensions, syrups, wafers, or combinations thereof. Oral compositions may be incorporated directly with the food of the diet. Preferred carriers for oral administration comprise inert diluents, assimilable edible carriers or combinations thereof. In other aspects of the invention, the oral composition may be prepared as a syrup or elixir. A syrup or elixir, and may comprise, for example, at least one active agent, a sweetening agent, a preservative, a flavoring agent, a dye, a preservative, or combinations thereof.

In certain preferred embodiments an oral composition may comprise one or more binders, excipients, disintegration agents, lubricants, flavoring agents, and combinations thereof. In certain embodiments, a composition may comprise one or more of the following: a binder, such as, for example, gum tragacanth, acacia, cornstarch, gelatin or combinations thereof; an excipient, such as, for example, dicalcium phosphate, mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate or combinations thereof; a disintegrating agent, such as, for example, corn starch, potato starch, alginic acid or combinations thereof; a lubricant, such as, for example, magnesium stearate; a sweetening agent, such as, for example, sucrose, lactose, saccharin or combinations thereof; a flavoring agent, such as, for example peppermint, oil of wintergreen, cherry flavoring, orange flavoring, etc.; or combinations thereof the foregoing. When the dosage unit form is a capsule, it may contain, in addition to materials of the above type, carriers such as a liquid carrier. Various other materials may be present as coatings or to otherwise modify the physical form of the dosage unit. For instance, tablets, pills, or capsules may be coated with shellac, sugar or both.

Additional formulations which are suitable for other modes of administration include suppositories. Suppositories are solid dosage forms of various weights and shapes, usually medicated, for insertion into the rectum, vagina or urethra. After insertion, suppositories soften, melt or dissolve in the cavity fluids. In general, for suppositories, traditional carriers may include, for example, polyalkylene glycols, triglycerides or combinations thereof. In certain embodiments, suppositories may be formed from mixtures containing, for example, the active ingredient in the range of about 0.5% to about 10%, and preferably about 1% to about 2%.

Sterile injectable solutions are prepared by incorporating the active compounds in the required amount in the appropriate solvent with various of the other ingredients enumerated above, as required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the various sterilized active ingredients into a sterile vehicle which contains the basic dispersion medium and/or the other ingredients. In the case of sterile powders for the preparation of sterile injectable solutions, suspensions or emulsion, the preferred methods of preparation are vacuum-drying or freeze-drying techniques which yield a powder of the active ingredient plus any additional desired ingredient from a previously sterile-filtered liquid medium thereof. The liquid medium should be suitably buffered if necessary and the liquid diluent first rendered isotonic prior to injection with sufficient saline or glucose. The preparation of highly concentrated compositions for direct injection is also contemplated, where the use of DMSO as solvent is envisioned to result in extremely rapid penetration, delivering high concentrations of the active agents to a small area.

The composition must be stable under the conditions of manufacture and storage, and preserved against the contaminating action of microorganisms, such as bacteria and fungi. It will be appreciated that endotoxin contamination should be kept minimally at a safe level, for example, less that 0.5 ng/mg protein.

In particular embodiments, prolonged absorption of an injectable composition can be brought about by the use in the compositions of agents delaying absorption, such as, for example, aluminum monostearate, gelatin or combinations thereof.

J. Vaccine Administration

The manner of administration of a vaccine may be varied widely. Any of the conventional methods for administration of a vaccine are applicable. For example, a vaccine may be conventionally administered intravenously, intradermally, intraarterially, intraperitoneally, intralesionally, intracranially, intraarticularly, intraprostaticaly, intrapleurally, intratracheally, intranasally, intravitreally, intravaginally, intratumorally, intramuscularly, intraperitoneally, subcutaneously, intravesicularlly, mucosally, intrapericardially, orally, rectally, nasally, topically, in eye drops, locally, using aerosol, injection, infusion, continuous infusion, localized perfusion bathing target cells directly, via a catheter, via a lavage, in cremes, in lipid compositions (e.g., liposomes), or by other method or any combination of the forgoing as would be known to one of ordinary skill in the art (see, for example, Remington's Pharmaceutical Sciences, 18th Ed. Mack Printing Company, 1990, incorporated herein by reference).

A vaccination schedule and dosages may be varied on a patient by patient basis, taking into account, for example, factors such as the weight and age of the patient, the type of disease being treated, the severity of the disease condition, previous or concurrent therapeutic interventions, the manner of administration and the like, which can be readily determined by one of ordinary skill in the art.

A vaccine is administered in a manner compatible with the dosage formulation, and in such amount as will be therapeutically effective and immunogenic. For example, the intramuscular route may be preferred in the case of toxins with short half lives in vivo. The quantity to be administered depends on the subject to be treated, including, e.g., the capacity of the individual's immune system to synthesize antibodies, and the degree of protection desired. The dosage of the vaccine will depend on the route of administration and will vary according to the size of the host. Precise amounts of an active ingredient required to be administered depend on the judgment of the practitioner. In certain embodiments, pharmaceutical compositions may comprise, for example, at least about 0.1% of an active compound. In other embodiments, the an active compound may comprise between about 2% to about 75% of the weight of the unit, or between about 25% to about 60%, for example, and any range derivable therein However, a suitable dosage range may be, for example, of the order of several hundred micrograms active ingredient per vaccination. In other non-limiting examples, a dose may also comprise from about 1 microgram/kg/body weight, about 5 microgram/kg/body weight, about 10 microgram/kg/body weight, about 50 microgram/kg/body weight, about 100 microgram/kg/body weight, about 200 microgram/kg/body weight, about 350 microgram/kg/body weight, about 500 microgram/kg/body weight, about 1 milligram/kg/body weight, about 5 milligram/kg/body weight, about 10 milligram/kg/body weight, about 50 milligram/kg/body weight, about 100 milligram/kg/body weight, about 200 milligram/kg/body weight, about 350 milligram/kg/body weight, about 500 milligram/kg/body weight, to about 1000 mg/kg/body weight or more per vaccination, and any range derivable therein. In non-limiting examples of a derivable range from the numbers listed herein, a range of about 5 mg/kg/body weight to about 100 mg/kg/body weight, about 5 microgram/kg/body weight to about 500 milligram/kg/body weight, etc., can be administered, based on the numbers described above. A suitable regime for initial administration and booster administrations (e.g., innoculations) are also variable, but are typified by an initial administration followed by subsequent inoculation(s) or other administration(s).

In many instances, it will be desirable to have multiple administrations of the vaccine, usually not exceeding six vaccinations, more usually not exceeding four vaccinations and preferably one or more, usually at least about three vaccinations. The vaccinations will normally be at from two to twelve week intervals, more usually from three to five week intervals. Periodic boosters at intervals of 1-5 years, usually three years, will be desirable to maintain protective levels of the antibodies.

The course of the immunization may be followed by assays for antibodies for the supernatant antigens. The assays may be performed by labeling with conventional labels, such as radionuclides, enzymes, fluorescents, and the like. These techniques are well known and may be found in a wide variety of patents, such as U.S. Pat. Nos. 3,791,932; 4,174,384 and 3,949,064, as illustrative of these types of assays. Other immune assays can be performed and assays of protection from challenge with the folate binding protein variant can be performed, following immunization.

K. Enhancement of an Immune Response

The present invention includes a method of enhancing the immune response in a subject comprising the steps of contacting one or more lymphocytes with a folate binding protein variant antigenic composition, wherein the antigen comprises as part of its sequence a sequence in accordance with SEQ ID NO:1 through SEQ ID NO:8, or a immunologically functional equivalent thereof. In certain embodiments the one or more lymphocytes is comprised in an animal, such as a human. In other embodiments, the lymphocyte(s) may be isolated from an animal or from a tissue (e.g., blood) of the animal. In certain preferred embodiments, the lymphocyte(s) are peripheral blood lymphocyte(s). In certain embodiments, the one or more lymphocytes comprise a T-lymphocyte or a B-lymphocyte. In a particularly preferred facet, the T-lymphocyte is a cytotoxic T-lymphocyte.

The enhanced immune response may be an active or a passive immune response. Alternatively, the response may be part of an adoptive immunotherapy approach in which lymphocyte(s) are obtained with from an animal (e.g., a patient), then pulsed with composition comprising an antigenic composition. In a preferred embodiment, the lymphocyte(s) may be administered to the same or different animal (e.g., same or different donors).

1. Cytotoxic T Lymphocytes

In certain embodiments, T-lymphocytes are specifically activated by contact with an antigenic composition of the present invention. In certain embodiments, T-lymphocytes are activated by contact with an antigen presenting cell that is or has been in contact with an antigenic composition of the invention.

T cells express a unique antigen binding receptor on their membrane (T-cell receptor), which can only recognize antigen in association with major histocompatibility complex (MHC) molecules on the surface of other cells. There are several populations of T cells, such as T helper cells and T cytotoxic cells. T helper cells and T cytotoxic cells are primarily distinguished by their display of the membrane bound glycoproteins CD4 and CD8, respectively. T helper cells secret various lymphokines, that are crucial for the activation of B cells, T cytotoxic cells, macrophages and other cells of the immune system. In contrast, a T cytotoxic cell that recognizes an antigen-MHC complex proliferates and differentiates into an effector cell called a cytotoxic T lymphocyte (CTL). CTLs eliminate cells of the body displaying antigen by producing substances that result in cell lysis.

CTL activity can be assessed by methods described herein or as would be known to one of skill in the art. For example, CTLs may be assessed in freshly isolated peripheral blood mononuclear cells (PBMC), in a phytohaemaglutinin-stimulated IL-2 expanded cell line established from PBMC (Bernard et al., 1998) or by T cells isolated from a previously immunized subject and restimulated for 6 days with DC infected with an adenovirus vector containing antigen using standard 4 h 51^(Cr) release microtoxicity assays. In another fluorometric assay developed for detecting cell-mediated cytotoxicity, the fluorophore used is the non-toxic molecule ALAMARBLUE (dye) (Nociari et al., 1998). The ALAMARBLUE (dye) is fluorescently quenched (i.e., low quantum yield) until mitochondrial reduction occurs, which then results in a dramatic increase in the alamarBlue fluorescence intensity (i.e., increase in the quantum yield). This assay is reported to be extremely sensitive, specific and requires a significantly lower number of effector cells than the standard ⁵¹Cr release assay.

In certain aspects, T helper cell responses can be measured by in vitro or in vivo assay with peptides, polypeptides or proteins. In vitro assays include measurement of a specific cytokine release by enzyme, radioisotope, chromaphore or fluorescent assays. In vivo assays include delayed type hypersensitivity responses called skin tests, as would be known to one of ordinary skill in the art.

2. Antigen Presenting Cells

In general, the term “antigen presenting cell” can be any cell that accomplishes the goal of the invention by aiding the enhancement of an immune response (i.e., from the T-cell or -B-cell arms of the immune system) against an antigen (e.g., a folate binding protein variant or a immunologically functional equivalent) or antigenic composition of the present invention. Such cells can be defined by those of skill in the art, using methods disclosed herein and in the art. As is understood by one of ordinary skill in the art (see for example Kuby, 1993, incorporated herein by reference), and used herein certain embodiments, a cell that displays or presents an antigen normally or preferentially with a class II major histocompatability molecule or complex to an immune cell is an “antigen presenting cell.” In certain aspects, a cell (e.g., an APC cell) may be fused with another cell, such as a recombinant cell or a tumor cell that expresses the desired antigen. Methods for preparing a fusion of two or more cells is well known in the art, such as for example, the methods disclosed in Goding, pp. 65-66, 71-74 1986; Campbell, pp. 75-83, 1984; Kohler and Milstein, 1975; Kohler and Milstein, 1976, Gefter et al., 1977, each incorporated herein by reference. In some cases, the immune cell to which an antigen presenting cell displays or presents an antigen to is a CD4⁺ TH cell. Additional molecules expressed on the APC or other immune cells may aid or improve the enhancement of an immune response. Secreted or soluble molecules, such as for example, immunomodulators and adjuvants, may also aid or enhance the immune response against an antigen. Such molecules are well known to one of skill in the art, and various examples are described herein.

VII. Peptide Formulations

Peptides containing the epitope motifs described herein are contemplated for use in therapeutics to provide universal FBP targets and antigens for CTLs in the HLA-A2 system. The development of therapeutics based on these novel sequences provides induction of tumor reactive immune cells in vivo through the formulation of synthetic cancer vaccines, as well as induction of tumor-reactive T-cells in vitro through either peptide-mediated (e.g., lipopeptide) or cell-mediated (e.g., EBV-B lines using either autologous or HLA-A2 transfectants where the gene for the peptide of interest is introduced, and the peptide is expressed associated with HLA-A2 on the surface). The use of these novel peptides as components of vaccines to prevent, or lessen the chance of cancer progression is also contemplated.

The peptides contemplated for use, being smaller than other compositions, such as envelope proteins, will have improved bioavailability and half lives. If desired, stability examinations may be performed on the peptides, including, e.g., pre-incubation in human serum and plasma; treatment with various proteases; and also temperature- and pH-stability analyses. If found to be necessary, the stability of the synthetic peptides may be enhanced by any one of a variety of methods such as, for example, employing D-amino acids in place of L-amino acids for peptide synthesis; using blocking groups like t-boc and the like; or encapsulating the peptides within liposomes. The bio-availability of select mixtures of peptides may also be determined by injecting radio-labeled peptides into experimental animals, such as mice and/or Rhesus monkeys, and subsequently analyzing their tissue distribution.

If stability enhancement was desired, it is contemplated that the use of dextrorotary amino acids (D-amino acids) would be advantageous as this would result in even longer bioavailability due to the inability of proteases to attack these types of structures. The peptides of the present invention may also be further stabilized, for example, by the addition of groups to the N- or C-termini, such as by acylation or amination. If desired, the peptides could even be in the form of lipid-tailed peptides, formulated into surfactant-like micelles, or other peptide multimers. The preparation of peptide multimers and surfactant-like micelles is described in detail in U.S. Ser. No. 07/945,865, incorporated herein by reference. The compositions of the present invention are contemplated to be particularly advantageous for use in economical and safe anti-tumor/anti-cancer therapeutics, and specific therapeutic formulations may be tested in experimental animal models, such as mice, rats, rabbits, guinea pigs, cats, goats, Rhesus monkeys, chimpanzees, and the like, in order to determine more precisely the dosage forms required.

In addition to the peptidyl compounds described herein, the inventors also contemplate that other sterically similar compounds may be formulated to mimic the key portions of the peptide structure and that such compounds may also be used in the same manner as the peptides of the invention. This may be achieved by the techniques of modelling and chemical design known to those of skill in the art. For example, esterification and other alkylations may be employed to modify the terminus of a peptide to mimic a particular terminal motif structure. It will be understood that all such sterically similar constructs fall within the scope of the present invention.

Therapeutic or pharmacological compositions of the present invention will generally comprise an effective amount of a CTL-stimulating peptide or peptides, dissolved or dispersed in a pharmaceutically acceptable medium. The phrase “pharmaceutically acceptable” refers to molecular entities and compositions that do not produce an allergic, toxic, or otherwise adverse reaction when administered to a human. Pharmaceutically acceptable media or carriers include any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents and the like. The use of such media and agents for pharmaceutical active substances is well known in the art. Except insofar as any conventional media or agent is incompatible with the active ingredient, its use in the therapeutic compositions is contemplated.

Supplementary active ingredients can also be incorporated into the therapeutic compositions of the present invention. For example, the stimulatory peptides may also be combined with peptides including cytotoxic T-cell- or T-helper-cell-inducing epitopes (as disclosed in U.S. Ser. No. 07/945,865; incorporated herein by reference) to create peptide cocktails for immunization and treatment.

The preparation of pharmaceutical or pharmacological compositions containing a CTL-stimulating peptide or peptides, including dextrorotatory peptides, as active ingredients will be known to those of skill in the art in light of the present disclosure. Typically, such compositions may be prepared as injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid prior to injection; as tablets or other solids for oral administration; as time release capsules; or in any other form currently used, including cremes, lotions, mouthwashes, inhalents and the like.

Solutions of the active compounds as free base or pharmacologically acceptable salts can be prepared in water suitably mixed with a surfactant, such as hydroxypropylcellulose. Dispersions can also be prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of storage and use, these preparations contain a preservative to prevent the growth of microorganisms.

Sterile solutions suitable for intravenous administration are preferred in certain embodiments and are contemplated to be particularly effective in stimulating CTLs and/or producing an immune response in an animal. The pharmaceutical forms suitable for injectable use include sterile aqueous solutions or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersions. In all cases the form must be sterile and must be fluid to the extent that easy syringability exists. It must be stable under the conditions of manufacture and storage and must be preserved against the contaminating action of microorganisms, such as bacteria and fungi.

A peptide or peptides can be formulated into a composition in a neutral or salt form. Pharmaceutically acceptable salts, include the acid addition salts (formed with the free amino groups of the peptide) and which are formed with inorganic acids such as, e.g., hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be derived from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, histidine, procaine, and the like.

The carrier can also be a solvent or dispersion medium containing, e.g., water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyethylene glycol, and the like), suitable mixtures thereof, and vegetable oils. The proper fluidity can be maintained by inter alia the use of a coating, such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. The prevention of the action of microorganisms can be brought inter alia by various antibacterial ad antifungal agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, e.g., sugars or sodium chloride. Prolonged absorption of the injectable compositions can be brought about by the use in the compositions of agents delaying absorption, for example, aluminum monostearate and gelatin.

Sterile injectable solutions are prepared by incorporating the active compounds in the required amount in the appropriate solvent with various of the other ingredients enumerated above, as required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the various sterilized active ingredients into a sterile vehicle which contains the basic dispersion medium and the required other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, the preferred methods of preparation are vacuum-drying and freeze-drying techniques which yield a powder of the active ingredient plus any additional desired ingredient from a previously sterile-filtered solution thereof.

The preparation of more- or highly-concentrated solutions for intramuscular injection is also contemplated. This is envisioned to have particular utility in facilitating the treatment of needle stick injuries to animals or even humans. In this regard, the use of DMSO as solvent is preferred as this will result in extremely rapid penetration, delivering high concentrations of the active peptide, peptides or agents to a small area.

The use of sterile formulations, such as saline-based washes, by veterinarians, technicians, surgeons, physicians or health care workers to cleanse a particular area in the operating field may also be particularly useful. Therapeutic formulations in accordance with the present invention may also be reconstituted in the form of mouthwashes, including the peptides alone, or in conjunction with antifungal reagents. Inhalant forms are also envisioned, which again, may contain active peptides or agents alone, or in conjunction with other agents, such as, e.g., pentamidine. The therapeutic formulations of the invention may also be prepared in forms suitable for topical administration, such as in cremes and lotions.

Suitable preservatives for use in such a solution include benzalkonium chloride, benzethonium chloride, chlorobutanol, thimerosal and the like. Suitable buffers include boric acid, sodium and potassium bicarbonate, sodium and potassium borates, sodium and potassium carbonate, sodium acetate, sodium biphosphate and the like, in amounts sufficient to maintain the pH at between about pH 6 and pH 8, and preferably, between about pH 7 and pH 7.5. Suitable tonicity agents are dextran 40, dextran 70, dextrose, glycerin, potassium chloride, propylene glycol, sodium chloride, and the like, such that the sodium chloride equivalent of the ophthalmic solution is in the range 0.9±0.2%. Suitable antioxidants and stabilizers include sodium bisulfite, sodium metabisulfite, sodium thiosulfate, thiourea and the like. Suitable wetting and clarifying agents include polysorbate 80, polysorbate 20, poloxamer 282 and tyloxapol. Suitable viscosity-increasing agents include dextran 40, dextran 70, gelatin, glycerin, hydroxyethylcellulose, hydroxmethyl-propylcellulose, lanolin, methylcellulose, petrolatum, polyethylene glycol, polyvinyl alcohol, polyvinylpyrrolidone, carboxymethylcellulose and the like.

Upon formulation, therapeutics will be administered in a manner compatible with the dosage formulation, and in such amount as is pharmacologically effective. The formulations are easily administered in a variety of dosage forms, such as the type of injectable solutions described above, but drug release capsules and the like can also be employed. As used herein, “pharmacologically effective amount” means an amount of composition is used that contains an amount of a peptide or peptides sufficient to significantly stimulate a CTL or generate an immune response in an animal.

In this context, the quantity of peptide(s) and volume of composition to be administered depends on the host animal to be treated, such as, the capacity of the host animal's immune system to produce an immune response. Precise amounts of active peptide required to be administered depend on the judgment of the practitioner and are peculiar to each individual.

A minimal volume of a composition required to disperse the peptide is typically utilized. Suitable regimes for administration are also variable, but would be typified by initially administering the compound and monitoring the results and then giving further controlled doses at further intervals. For example, for parenteral administration, a suitably buffered, and if necessary, isotonic aqueous solution would be prepared and used for intravenous, intramuscular, subcutaneous or even intraperitoneal administration. One dosage could be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, “Remington's Pharmaceutical Sciences” 15th Edition, pages 1035-1038 and 1570-1580).

In certain embodiments, active compounds may be administered orally. This is contemplated for agents that are generally resistant, or have been rendered resistant, to proteolysis by digestive enzymes. Such compounds are contemplated to include chemically designed or modified agents; dextrorotatory peptides; and peptide and liposomal formulations in timed-release capsules to avoid peptidase, protease and/or lipase degradation.

Oral formulations may include compounds in combination with an inert diluent or an edible carrier which may be assimilated; those enclosed in hard- or soft-shell gelatin capsules; those compressed into tablets; or those incorporated directly with the food of the diet. For oral therapeutic administration, the active compounds may be incorporated with excipients and used in the form of ingestible tablets, buccal tables, troches, capsules, elixirs, suspensions, syrups, wafers, and the like. Such compositions and preparations should generally contain at least 0.1% of active compound. The percentage of the compositions and preparations may, of course, be varied and may conveniently be between about 2 to about 60% of the weight of the unit. The amount of active compounds in such therapeutically useful compositions is such that a suitable dosage will be obtained.

Tablets, troches, pills, capsules and the like may also contain the following: a binder, as gum tragacanth, acacia, corn starch, or gelatin; excipients, such as dicalcium phosphate; a disintegrating agent, such as corn starch, potato starch, alginic acid and the like; a lubricant, such as magnesium stearate; and a sweetening agent, such as sucrose, lactose or saccharin may be added or a flavoring agent, such as peppermint, oil of wintergreen, or cherry flavoring. When the dosage unit form is a capsule, it may contain, in addition to materials of the above type, a liquid carrier. Various other materials may be present as coatings or to otherwise modify the physical form of the dosage unit. For instance, tablets, pills, or capsules may be coated with shellac, sugar or both. A syrup of elixir may contain the active compounds sucrose as a sweetening agent methyl and propylparaben as preservatives, a dye and flavoring, such as cherry or orange flavor. Of course, any material used in preparing any dosage unit form should be pharmaceutically pure and substantially non-toxic in the amounts employed. In addition, the active compounds may be incorporated into sustained-release preparation and formulations.

The peptides may be used in their immunizing capacity by administering an amount effective to generate an immune response in an animal. In this sense, such an “amount effective to generate an immune response” means an amount of composition that contains a peptide or peptide mixture sufficient to significantly produce an antigenic response in the animal.

VIII. Examples

The following examples are included to demonstrate preferred embodiments of the invention. It should be appreciated by those of skill in the art that the techniques disclosed in the examples which follow represent techniques discovered by the inventors to function well in the practice of the invention, and thus can be considered to constitute preferred modes for its practice. However, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the spirit and scope of the invention.

Example 1 Rationale for Variant Design

Studies in experimental models regarding lymphocyte development in the thymus show that interaction of thymocytes with weak or null (no apparent effect) agonists lead to positive selection (i.e. survival) of responders for a specific Ag, while stimulation with strong agonists leads to negative selection (deletion of reactive CTL). Similarly, recent studies on CD8⁺ cell responses from peripheral blood show that Ag variants with null or weak agonistic activity induced expansion of precursors of CTL responding to a model Ag, but not effector function. These results were obtained with transgenic animals, and the recipients for the CTL were heavily irradiated. There is little information concerning how the responders to tumor, and/or their precursors, can be maintained and avoid elimination in healthy individuals, or patients without evidence of disease. However, the presence of such precursors, or of activated CTL recognizing tumor Ag, (Peoples et al., 1998; Hudson et al., 1998; Peoples et al, 1998; Kim et al., 1999; Lee et al., 2000) is proof that such responders exist in the peripheral blood. Approaches to promote their survival, expansion and induction of lytic formation is beneficial for the patients. If the responders targeted for survival are low-affinity CTL, the weak affinity is expected to be compensated by a significant increase in effector numbers. If the responders are of high affinity, protection from AICD will also allow their expansion.

To design “survival inducing” Ag, the present invention focuses on the FBP epitope E39: EIWTHSYKV (SEQ ID NO:268). This epitope is recognized, although with low affinity, by ovarian and breast tumor reactive CTL. It was predicted that improved immunogenicity in terms of net gain in cell numbers reacting with the wild-type Ag is achieved by reducing the positive charge at the amino acid in position 5 (histidine) and replacement of histidine with phenylalanine (Phe). Phe is not charged, but its benzene aromatic ring is a close substitution for the imidazole ring of histidine. To ensure a better flexibility of the residues in the peptide, the phenolic structure of tyrosine was replaced with the aliphatic core chain of Threonine (Thr). Both Tyr and Thr contain an OH (hydroxyl) side chain group. Thus, the positive charge in position 5 and the rigid structure of Tyr were eliminated. In a specific embodiment, this increases the flexibility of the residues 5-9 (SYKV) (SEQ ID NO:270) in the peptide and allows for a better fitting of the TCR with the peptide MHC complex. The variant: E I W T F S T K V (SEQ ID NO:5) was designated J65. Additional variants of J65 were created with changes in position 7 (Tyr)→Thr only=designated J77, in position 5 only Phe→His=designated J78, and in positions 1 and 6. These analogs/variants are listed in Table 5.

TABLE 5 Variants of Folate Binding Protein VARIANT SEQUENCE CHANGE E39 EIWTHSYKV (SEQ wild type ID NO: 268) J77 EIWTHSTKV (SEQ Y7→T ID NO: 1) J78 EIWTFSYKV (SEQ ID H5→F NO: 2) J68 FIWTFATKV (SEQ ID E1→F, H5→F,  NO: 3) S6→A; Y7→T J67 EIWTHATKV (SEQ S6→A, Y7→T ID NO: 4) J66 FIWTFSTKV (SEQ ID E1→F, H5→F, Y7→T NO: 271) J65 EIWTFSTKV (SEQ ID H5→F, Y7→T NO: 5) J64 GIWTHSTKV (SEQ E1→G, Y7→T ID NO: 7) J63 FIWTHSTKV (SEQ ID E1→F, Y7→T NO: 8)

Selection of these Ag variants was made on the principle of Ag alteration aiming to alternate signaling. In addition to substitutions H→F (Pos. 5) and Y→T (pos. 7), substitutions were introduced in the other positions: S→A (Pos. 6 and Glu (B)→F and E→Gly (G) (in Pos. 1). The purpose of these substitutions was to remove potential reacting groups with the TCR. In the substitution S→A (Pos. A), this change removes a side chain OH group. In position 1, the substitution E (glutamic acid)→glycine, removes the entire aliphatic side chain plus the charged COO group. Also in position 1, the substitution E→F (removes the charged group COO, but introduces an aromatic ring). These substitutions aim to diminish the reactivity of the peptide with the TCR.

Example 2 IFN-γ Induction and CTL Activity

The HLA-A2 stabilizing ability of the variant peptides has also been determined (FIG. 1). The results show that the stabilizing ability of J65 is almost half of the stabilizing ability of E39. In contrast, substitutions at position 1 increase the binding affinity of the peptide. The results in FIG. 2 show the cytolytic activity of J65-induced CTL compared with E39-induced CTL. The results indicate that J65 was a weaker inducer of IFN-γ from 3×J65 stimulated cultures than J77 and E39, suggesting that the changes in the sequence had cumulative effects in decreasing IFN-γ induction.

To address the effects of FBP variants on induction of CTL activity, PBMC cultures from the healthy donor stimulated three times with J65 were split in three and restimulated with either E39 or J65 or J77. A control culture was made of the same PBMC stimulated three times with E39 and restimulated with E39 for the fourth time. PBMC stimulated three times with E39 (3×E39) followed by E39 showed moderate weak recognition of E39. In contrast, 3×J65 stimulated CTL showed significantly higher recognition of E39 after stimulation with E39. A similar picture was observed with 3×J65 cells restimulated with J65, while 3×J65 restimulated with J77 showed significantly lower CTL activity than 3×J65 stimulated with the other peptides. It was recently reported that memory CTL reacting with the tumor Ag such as FBP are present in the blood of healthy individuals (Lee et al., 2000). These cells can be easily activated by stimulation with the corresponding peptide presented on dendritic cells (Kim et al., 1999). To evaluate the stimulating ability of the analogs J65 and J77, PBMC from a responding donor were stimulated with E39, J65 and J77. These results show that the potentiating role of J65 in responder proliferation and cytotoxicity does not reflect enhanced IL-2 and/or IFN-γ secretion compared with the wild-type Ag, but its weaker cytokine-inducing activity appears to protect CTL of higher affinity from apoptosis by avoiding overstimulation.

Example 3 Specific IL-2 Induction by Priming with FBP Variants

In J65-primed CTL, higher CTL activity and IFN-γ secretion can be elicited by the wild-type epitope E39, suggesting a protective effect of the previous stimulations. The results in FIG. 3 show that J65 and J77 induced lower levels of IL-2 in the PBMC of this donor compared with the wild-type peptide E39. To identify which of E39 variants induced higher cell expansion, PBMC from the same donor were stimulated three times with the corresponding peptide, and the resulting live cells were counted a week after each stimulation. The results in FIG. 4 show that cultures stimulated with E39 initially expanded faster than other cultures; however, after the third stimulation, cultures stimulated with J65 increased faster in numbers. In contrast, cultures stimulated with J78 (H→F) and J77 (Y→T) proliferated slower than control cultures which were not stimulated with peptide. Similar results were obtained with J65 in another donor (FIG. 5). In this donor, cells stimulated with E39 died after the third stimulation while cells stimulated by J65 expanded faster. Cells stimulated with J77 and J78 also expanded, but at a slower rate.

TABLE 6 Accessions/SEQ ID NOs Sequence P14207 M V W K W M P L L L L L V C V A T M C S A Q D R T D L L N V C M SEQ ID NO: 9 D A K H H K T K P G P E D K L H D Q C S P W K K N A C C T A S T S Q E L H K D T S R L Y N F N W D H C G K M E P A C K R H F I Q D T C L Y E C S P N L G P W I Q Q V N Q T W R K E R F L D V P L C K E D C Q R W W E D C H T S H T C K S N W H R G W D W T S G V N K C P A G A L C R T F E S Y F P T P A A L C E G L W S H S Y K V S N Y S R G S G R C I Q M W F D S A Q G N P N E E V A R F Y A A A M H V N A G E M L H G T G G L L L S L A L M L Q L W L L G P15328 M A Q R M T T Q L L L L L V W V A V V G E A Q T R I A W A R T E SEQ ID NO: 10  L L N V C M N A K H H K E K P G P E D K L H E Q C R P W R K N A C C S T N T S Q E A H K D V S Y L Y R F N W N H C G E M A P A C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R V L N V P L C K E D C E Q W W E D C R T S Y T C K S N W H K G W N W T S G F N K C A V G A A C Q P F H F Y F P T P T V L C N E I W T H S Y K V S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A A A M S G A G P W A A W P F L L S L A L M L L W L L S P13255 M V D S V Y R T R S L G V A A E G I P D Q Y A D G E A A R V W Q SEQ ID NO: 11  L Y I G D T R S R T A E Y K A W L L G L L R Q H G C H R V L D V A C G T G V D S I M L V E E G F S V T S V D A S D K M L K Y A L K E R W N R R K E P A F D K W V I E E A N W L T L D K D V P A G D G F D A V I C L G N S F A H L P D S K G D Q S E H R L A L K N I A S M V R P G G L L V I D H R N Y D Y I L S T G C A P P G K N I Y Y K S D L T K D I T T S V L T V N N K A H M V T L D Y T V Q V P G A G R D G A P G F S K F R L S Y Y P H C L A S F T E L V Q E A F G G R C Q H S V L G D F K P Y R P G Q A Y V P C Y F I H V L K K T G NP_000793 M A Q R M T T Q L L L L L V W V A V V G E A Q T R I A W A R T E SEQ ID NO: 12 L L N V C M N A K H H K E K P G P E D K L H E Q C R P W R K N A C C S T N T S Q E A H K D V S Y L Y R F N W N H C G E M A P A C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R V L N V P L C K E D C E Q W W E D C R T S Y T C K S N W H K G W N W T S G F N K C A V G A A C Q P F H F Y F P T P T V L C N E I W T H S Y K V S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A A A M S G A G P W A A W P F L L S L A L M L L W L L S AAB05827 M A Q R M T T Q L L L L L V W V A V V G E A Q T R I A W A R T E SEQ ID NO: 13 L L N V C M N A K H H K E K P G P E D K L H E Q C R P W R K N A C C S T N T S Q E A H K D V S Y L Y R F N W N H C G E M A P A C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R V L N V P L C K E D C E Q W W E D C R T S Y T C K S N W H K G W N W T S G F N K C A V G A A C Q P F H F Y F P T P T V L C N E I W T H S Y K V S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A A A M S G A G P W A A W P F L L S L A L M L L W L L S AAG36877 M A Q W W Q I L L G L W A V L P T L A G D K L L S V C M N S K R SEQ ID NO: 14 H K Q E P G P E D E L Y Q E C R P W E D N A C C T R S T S W E A H L E E P L L F N F S M M H C G L L T P A C R K H F I Q A I C F H E C S P N L G P W I Q P V V P N G Q E E Q R V W G V P L C Q E D C E D W W R A C H S S L T C K S N W L H G W D W S E E K K H C P A H E P C L P F S Y H F P T P D D L C E K I W N N T F K A S P E R R N S G R C L Q K W F E P T L S N P N V E V A L H F A G S A L A P Q L S Y T L P A F S L C L L F H P S42627 M V D S V Y R T R S L G V A A E G L P D Q Y A E G E A A R V W Q SEQ ID NO: 15 L Y I G D T R S R T A E Y K A W L L G L L R Q H G C Q R V L D V A C G T G V D S I M L V E E G F S V T S V D A S D K M L K Y A L K E R W N R R H E P A F D K W V I E E A N W M T L D K D V P Q S A E G G F D A V I C L G N S F A H L P D C K G D Q S E H R L A L K N I A S M V R A G G L L V I D H R N Y D H I L S T G C A P P G K N I Y Y K S D L T K D V T T S V L I V N N K A H M V T L D Y T V Q V P G A G Q D G S P G L S K F R L S Y Y P H C L A S F T E L L Q A A F G G K C Q H S V L G D F K P Y K P G Q T Y I P C Y F I H V L K R T D S00112 M V D S V Y R T R S L G V A A E G I P D Q Y A D G E A A R V W Q SEQ ID NO: 16 L Y I G D T R S R T A E Y K A W L L G L L R Q H G C H R V L D V A C G T G V D S I M L V E E G F S V T S V D A S D K M L K Y A L K E R W N R R K E P A F D K W V I E E A N W L T L D K D V P A G D G F D A V I C L G N S F A H L P D S K G D Q S E H R L A L K N I A S M V R P G G L L V I D H R N Y D Y I L S T G C A P P G K N I Y Y K S D L T K D I T T S V L T V N N K A H M V T L D Y T V Q V P G A G R D G A P G F S K F R L S Y Y P H C L A S F T E L V Q E A F G G R C Q H S V L G D F K P Y R P G Q A Y V P C Y F I H V L K K T G BFBO A Q A P R T P R A R T D L L N V C M D A K H H K A E P G P E D S SEQ ID NO: 17 L H E Q C S P W R K N A C C S V N T S I E A X K D I S Y L Y R F N W D H C G K M E P A C K R H F I Q D T C L Y E C S P N L G P W I R E V N Q R W R K E R V L G V P L C K E D C Q S W W E D C R T S Y T C K S N W H K G W N W T S G Y N Q C P V K A A H C R F D F Y F P T P A A L C N E I W S H S Y K V S N Y S R G S G R C I Q M W F D P F Q G N P N E E V A R F Y A E N P T S G S T P Q G I S62670 Q A T R A R T E L L N V F A D A K R E K P K SEQ ID NO: 18 S62669 Q A T R A E T E N L N V D M D A K H H K E K SEQ ID NO: 19 A55968 M V P S S P A V E K Q V P V E P G P D P E L R S W R H L V C Y L SEQ ID NO: 20 C F Y G F M A Q I R P G E S F I T P Y L L G P D K N F T R E Q V T N E I T P V L S Y S Y L A V L V P V F L L T D Y L R Y T P V L L L Q G L S F V S V W L L L L L G H S V A H M Q L M E L F Y S V T M A A R I A Y S S Y I F S L V R P A R Y Q R V A G Y S R A A V L L G V F T S S V L G Q L L V T V G R V S F S T L N Y I S L A F L T F S V V L A L F L K R P K R S L F F N R D D R G R C E T S A S E L E R M N P G P G G K L G H A L R V A C G D S V L A R M L R E L G D S L R R P Q L R L W S L W W V F N S A G Y Y L V V Y Y V H I L W N E V D P T T N S A R V Y N G A A D A A S T L L G A I T S F A A G F V K I R W A R W S K L L I A G V T A T Q A G L V F L L A H T R H P S S I W L C Y A A F V L F R G S Y Q F L V P I A T F Q I A S S L S K E L C A L V F G V N T F F A T I V K T I I T F I V S D V R G L G L P V R K P V I L R V L P D P V H H L L L G G H G A W P A A L P A G P P P A A A P G P G P E E C R G G E G S T G T E R A G Q G P R R L Q P A Q S P P L S P E D S L G A V G P A S L E Q R Q S D P Y L A Q A P A P Q A A E F L S P V T T P S P C T L S S A Q A S G P E A A D E T C P Q L A V H P P G V S K L G L Q C L P S D G V Q N V N Q A45753 M A Q R M T T Q L L L L L V W V A V V G E A Q T R I A W A R T E SEQ ID NO: 21 L L N V C M N A K H H K E K P G P E D K L H E Q C R P W R K N A C C S T N T S Q E A H K D V S Y L Y R F N W N H C G E M A P A C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R V L N V P L C K E D C E Q W W E D C R T S Y T C K S N W H K G W N W T S G F N K C A V G A A C Q P F H F Y F P T P T V L C N E I W T H S Y K V S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A A A M S G A G P W A A W P F L L S L A L M L L W L L S A33417 M V W K W M P L L L L L V C V A T M C S A Q D R T D L L N V C M SEQ ID NO: 22  D A K H H K T K P G P E D K L H D Q C S P W K K N A C C T A S T S Q E L H K D T S R L Y N F N W D H C G K M E P A C K R H F I Q D T C L Y E C S P N L G P W I Q Q V N Q T W R K E R F L D V P L C K E D C Q R W W E D C H T S H T C K S N W H R G W D W T S G V N K C P A G A L C R T F E S Y F P T P A A L C E G L W S H S Y K V S N Y S R G S G R C I Q M W F D S A Q G N P N E E V A R F Y A A A M H V N A G E M L H G T G G L L L S L A L M L Q L W L L G B40969 M A W K Q T P L L L L V Y M V T T G S G R D R T D L L N V C M D SEQ ID NO: 23  A K H H K T K P G P E D K L H D Q C S P W K K N A C C S V N T S Q E L H K A D S R L Y F N W D H C G K M E P A C K S H F I Q D S C L Y E C S P N L G P W I Q Q V D Q S W R K E R F L D V P L C K E D C H Q W W E A C R T S F T C K R D W H K G W D W S S G I N K C P N T A P C H T F E Y Y F P T P A S L C E G L W S H S Y K V S N Y S R G S G R C I Q M W F D S T Q G N P N E D V V K F Y A S F M T S G T V P H A A V L L V P S L A P V L S L W L P G A40969 M A H L M T V Q L L L L V M W M A E C A Q S R A T R A R T E L L  SEQ ID NO: 24  N V C M D A K H H K E K P G P E D N L H D Q C S P W K T N S C C S T N T S Q E A H K D I S Y L Y R F N W N H C G T M T S E C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R I L D V P L C K E D C Q Q W W E D C Q S S F T C K S N W H K G W N W S S G H N E C P V G A S C H P F T F Y F P T S A A L C E E I W S H S Y K L S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A E A M S G A G L H G T W P L L C S L S L V L L W V I S NP_057943 M A Q R M T T Q L L L L L V W V A V V G E A Q T R I A W A R T E SEQ ID NO: 25  L L N V C M N A K H H K E K P G P E D K L H E Q C R P W R K N A C C S T N T S Q E A H K D V S Y L Y R F N W N H C G E M A P A C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R V L N V P L C K E D C E Q W W E D C R T S Y T C K S N W H K G W N W T S G F N K C A V G A A C Q P F H F Y F P T P T V L C N E I W T H S Y K V S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A A A M S G A G P W A A W P F L L S L A L M L L W L L NP_057942 M A Q R M T T Q L L L L L V W V A V V G E A Q T R I A W A R T E SEQ ID NO: 26 L L N V C M N A K H H K E K P G P E D K L H E Q C R P W R K N A C C S T N T S Q E A H K D V S Y L Y R F N W N H C G E M A P A C   K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R V L N V P L C K E D C E Q W W E D C R T S Y T C K S N W H K G W N W T S G F N K C A V G A A C Q P F H F Y F P T P T V L C N E I W T H S Y K V S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A A A M S G A G P W A A W P F L L S L A L M L L W L L S NP_057941 M A Q R M T T Q L L L L L V W V A V V G E A Q T R I A W A R T E SEQ ID NO: 27 L L N V C M N A K H H K E K P G P E D K L H E Q C R P W R K N A C C S T N T S Q E A H K D V S Y L Y R F N W N H C G E M A P A C   K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R V L N V P L C K E D C E Q W W E D C R T S Y T C K S N W H K G W N W T S G F N K C A V G A A C Q P F H F Y F P T P T V L C N E I W T H S Y K V S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A A A M S G A G P W A A W P F L L S L A L M L L W L L NP_057937 M A Q R M T T Q L L L L L V W V A V V G E A Q T R I A W A R T E SEQ ID NO: 28 L L N V C M N A K H H K E K P G P E D K L H E Q C R P W R K N A C C S T N T S Q E A H K D V S Y L Y R F N W N H C G E M A P A C   K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R V L N V P L C K E D C E Q W W E D C R T S Y T C K S N W H K G W N W T S G F N K C A V G A A C Q P F H F Y F P T P T V L C N E I W T H S Y K V S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A A A M S G A G P W A A W P F L L S L A L M L L W L L S NP_057936 M A Q R M T T Q L L L L L V W V A V V G E A Q T R I A W A R T E SEQ ID NO: 29  L L N V C M N A K H H K E K P G P E D K L H E Q C R P W R K N A C C S T N T S Q E A H K D V S Y L Y R F N W N H C G E M A P A C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R V L N V P L C K E D C E Q W W E D C R T S Y T C K S N W H K G W N W T S G F N K C A V G A A C Q P F H F Y F P T P T V L C N E I W T H S Y K V S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A A A M S G A G P W A A W P F L L S L A L M L L W L L S NP_037439 M A S V P K T N K I E P R S Y S I I P S C S I R R L G P A L N T SEQ ID NO: 30  P I F Q S K R N G P R G H S A Y S I E G R Q R Q G A G R A V V P R A D R P P A P K I Q L R A F Y L Q Q L Y Y T L L E L E L P R L L A P D L P S N G S S L K D L K W T H S N Y R A S K E S C I V I F V T T S P G R E W V I C A P A A F L G C G S L Q A P S P E S E P S F P V T R G H H G R H G D Y H R K L I G Q T F E W V V V R R H G G R A I G P R L S R V T K A A G A R P P A G A G E G L R V G F D L I N A P I P P A K G V S A R R H V L A L E L P Q L S K NP_032061 M A W K Q T P L L L L V Y M V T T G S G R D R T D L L N V C M D SEQ ID NO: 31 A K H H K T K P G P E D K L H D Q C S P W K K N A C C S V N T S Q E L H K A D S R L Y F N W D H C G K M E P A C K S H F I Q D S C L Y E C S P N L G P W I Q Q V D Q S W R K E R F L D V P L C K E D C H Q W W E A C R T S F T C K R D W H K G W D W S S G I N K C P N T A P C H T F E Y Y F P T P A S L C E G L W S H S Y K V S N Y S R G S G R C I Q M W F D S T Q G N P N E D V V K F Y A S F M T S G T V P H A A V L L V P S L A P V L S L W L P G NP_032060 M A H L M T V Q L L L L V M W M A E C A Q S R A T R A R T E L L SEQ ID NO: 32 N V C M D A K H H K E K P G P E D N L H D Q C S P W K T N S C C S T N T S Q E A H K D I S Y L Y R F N W N H C G T M T S E C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R I L D V P L C K E D C Q Q W W E D C Q S S F T C K S N W H K G W N W S S G H N E C P V G A S C H P F T F Y F P T S A A L C E E I W S H S Y K L S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A E A M S G A G L H G T W P L L C S L S L V L L W V I S NP_000795 M A W Q M M Q L L L L A L V T A A G S A Q P R S A R A R T D L L SEQ ID NO: 33 N V C M N A K H H K T Q P S P E D E L Y G Q C S P W K K N A C C T A S T S Q E L H K D T S R L Y N F N W D H C G K M E P T C K R H F I Q D S C L Y E C S P N L G P W I R Q V N Q S W R K E R I L N V P L C K E D C E R W W E D C R T S Y T C K S N W H K G W N W T S G I N E C P A G A L C S T F E S Y F P T P A A L C E G L W S H S F K V S N Y S R G S G R C I Q M W F D S A Q G N P N E E V A K F Y A A A M N A G A P S R G I I D S NP_000794 M V W K W M P L L L L L V C V A T M C S A Q D R T D L L N V C M SEQ ID NO: 34 D A K H H K T K P G P E D K L H D Q C S P W K K N A C C T A S T S Q E L H K D T S R L Y N F N W D H C G K M E P A C K R H F I Q D T C L Y E C S P N L G P W I Q Q V N Q T W R K E R F L D V P L C K E D C Q R W W E D C H T S H T C K S N W H R G W D W T S G V N K C P A G A L C R T F E S Y F P T P A A L C E G L W S H S Y K V S N Y S R G S G R C I Q M W F D S A Q G N P N E E V A R F Y A A A M H V N A G E M L H G T G G L L L S L A L M L Q L W L L G AAF66225 M A H L M A G Q W L L L L M W M A E C A Q S R A T R A R T E L L SEQ ID NO: 35 N V C M D A K H H K E K P G P E D K L H D Q C S P W K T N A C C S T N T S Q E D T K D I S Y L Y R F N W N H C G T M T P E C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R I L D V P L C K E D C V L W W E D C K S S F T C K S N W L K G W N W T S G H N E C P V G A S C H P F T F Y F P T P A V L C E K I W S H S Y K L S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A E V M S G A G L R E A W L L V C S L S L V L F C V V S BAA37125 M L R F A I T L F A V I T S S T C Q Q Y G C L E G D T H K A N P SEQ ID NO: 36 S P E P N M H E C T L Y S E P02752 M L R F A I T L F A V I T S S T C Q Q Y G C L E G D T H K A N P SEQ ID NO: 37 S P E P N M H E C T L Y S E S S C C Y A N F T E Q L A H S P I I K V S N S Y W N R C G Q L S K S C E D F T K K I E C F Y R C S P H A A R W I D P R Y T A A I Q S V P L C Q S F C D D W Y E A C K D D S I C A H N W L T D W E R D E S G E N H C K S K C V P Y S E M Y A N G T D M C Q S M W G E S F K V S E S S C L C L Q M N K K D M V A I K H L L S E S S E E S S S M S S S E E H A C Q K K L L K F E A L Q Q E E G E E R R Q05685 M A W K Q T P L L L L V Y M V T T G S G R D R T D L L N V C M D SEQ ID NO: 38 A K H H K T K P G P E D K L H D Q C S P W K K N A C C S V N T S Q E L H K A D S R L Y F N W D H C G K M E P A C K S H F I Q D S C L Y E C S P N L G P W I Q Q V D Q S W R K E R F L D V P L C K E D C H Q W W E A C R T S F T C K R D W H K G W D W S S G I N K C P N T A P C H T F E Y Y F P T P A S L C E G L W S H S Y K V S N Y S R G S G R C I Q M W F D S T Q G N P N E D V V K F Y A S F M T S G T V P H A A V L L V P S L A P V L S L W L P G P35846 M A H L M T V Q L L L L V M W M A E C A Q S R A T R A R T E L L SEQ ID NO: 39 N V C M D A K H H K E K P G P E D N L H D Q C S P W K T N S C C S T N T S Q E A H K D I S Y L Y R F N W N H C G T M T S E C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R I L D V P L C K E D C Q Q W W E D C Q S S F T C K S N W H K G W N W S S G H N E C P V G A S C H P F T F Y F P T S A A L C E E I W S H S Y K L S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A E A M S G A G L H G T W P L L C S L S L V L L W V I S P02702 A Q A P R T P R A R T D L L N V C M D A K H H K A E P G P E D S SEQ ID NO: 40 L H E Q C S P W R K N A C C S V N T S I E A X K D I S Y L Y R F N W D H C G K M E P A C K R H F I Q D T C L Y E C S P N L G P W I R E V N Q R W R K E R V L G V P L C K E D C Q S W W E D C R T S Y T C K S N W H K G W N W T S G Y N Q C P V K A A H C R F D F Y F P T P A A L C N E I W S H S Y K V S N Y S R G S G R C I Q M W F D P F Q G N P N E E V A R F Y A E N P T S G S T P Q G I AAD53001 M A H L M T M Q L L L L L I W V S E C A Q S R A T R A R T E L L SEQ ID NO: 41 N V C M D A K H H K E K P G P E D N L H N Q C S P W K K N S C C S T N T S Q E A H E D I S Y L Y R F N W D H C G K M T L E C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R I L D V P L C K E D C Q R W W E D C R T S F T C K S N W H K G W N W T S G Y N Q C P V G A S C R H F D F Y F P T P A A L C E E I W S H S Y K L S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A E A M S G A G L H G A W P L M C S L S L V L L W V F S R V P L T F AAD33741 M A L G R A R L L L L L V C V A V T W A A R P D L L N I C M D A SEQ ID NO: 42 K H H K T K P G P E D G L H E Q C S P W E M N A C C S V N T S Q E A H N D I S Y L Y K F N W E H C G K M K P A C K R H F I Q D T C L Y E C S P N L G P W I Q E V N Q K W R R E R I L N V P L C K E D C Q N W W E D C R T S Y T C K S N W H E G W N W S S G Y N R C P A N A A C H P F D F Y F P T P A A L C S Q I W S N S Y K Q S N Y S R G S G R C I Q M W F D P E Q G N P N E V V A R Y Y A Q I M S G A G L S E A W P L Q F G L A L T L L W L L S AAD33740 M A W R L T L F V L L G L V A A V G G A R A K S D M L N V C M D SEQ ID NO: 43 A K H H K P K P S P E D K L H D Q C S P W R K N S C C S V N T S L E A H K D I S Y L Y R F N W D H C G K M E P A C K R H F I Q D T C L Y E C S P N L G P W I Q E V N Q K W R R E R I L N V P L C K E D C Q I W W E D C R T S Y T C K S N W H K G W N W T S G Y N Q C P V S A A C H R F D F Y F P T P A A L C N E I W S H S F E V S S Y S R G S G R C I Q M W F D P A Q G N P N E A V A R Y Y A E G P L L T N L T E M V K H W V T G T E M V K H W V T G AAD19354 M A H L M T V Q L L L L V M W M A E C A Q S R A T R A R T E L L SEQ ID NO: 44 N V C M D A K H H K AAD19353 M A H L M T V Q L L L L V M W M A E C A Q S R A T R A R T E L L SEQ ID NO: 45 N V C M D A K H H K E K P G P E D N L H D Q C S P W K T N S C C S T N T S Q E A H K D I S Y L Y R F N W N H C G T M T S E C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R I L D V P L C K E D C Q Q W W E D C Q S S F T C K S N W H K G W N W S S G H N E C P V G A S C H P F T F Y F P T S A A L C E E I W S H S Y K L S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A E A M S G A G F H G T W P L L C S L S L V L L W V I S AAC98303 M A W Q M M Q L L L L A L V T A A G S A Q P R S A R A R T D L L SEQ ID NO: 46 N V C M N A K H H K T Q P S P E D E L Y G Q AAB81938 T C L Y E C S P N L G P W I Q Q V D Q S W R K E R V L N V P L C SEQ ID NO: 47 K E D C E Q W W E D C R T S Y T C K S N W H K G C N W T S G F N K C A V G A A C Q P F H F Y F P T P I A R AAB81937 M L P A A T E V Q H R L Q G Q K D M V W K W M P L L L L L V C V SEQ ID NO: 48 A T M C S A Q D R T D L L N V C M D A K H H K T K P G P E D K L H D Q C S P W K K N A C C T A S T S Q E L H K D T S R L Y N F N W D H C G K M E P A C S A T S S R T P V S M S A H Q P G A L D P A G E S E L A A K N A S W M C P Y A K S T V S A G G R I V T P P T R A R A T G T E D G T G P Q E L T S A Q L G L S A A P L S P T S P L Q L P F V K A S G V T H T R S A T T A E G A A A A S R C G L L Q P R A T P T R K W R G S M L Q P C M AAB49703 M P W K L T A L L L F L A G V V S V C R A R A R T D L L N V C M SEQ ID NO: 49 D A K H H K V E P G P E D E L H D Q C V P W K K N A C C S A R V S H E L H R D K S S L Y N F S W E H C G R M E P A C K R H F I Q N N C L Y E C S P N L G P W F Q E V N Q K W R K E R F L N V P L C K E D C L D W W E D C R T S Y T C K S S W H K G W N W S S G S N Q C P T G T T C D T F E S F F P T P A A L C E G I W N H D Y K F T N Y S R G S G R C I Q M W F D A A E G N P N E E V A R F Y A L A L S A G T M S L G T G P L L L S A A L M L P L G L L D AAB35932 Q A T R A E T E N L N V D M D A K H H K E K SEQ ID NO: 50 1011184A A Q A P R T P R A R T D L L N V C M D A K H H K A E P G P E D S SEQ ID NO: 51 L H E Q C S P W R K N A C C S V N T S I E A X K D I S Y L Y R F N W D H C G K M E P A C K R H F I Q D T C L Y E C S P N L G P W I R E V N Q R W R K E R V L G V P L C K E D C Q S W W E D C R T S Y T C K S N W H K G W N W T S G Y N Q C P V K A A H C R F D F Y F P T P A A L C N E I W S H S Y K V S N Y S R G S G R C I Q M W F D P F Q G N P N E E V A R F Y A E N P T S G S T P Q G I 0908212A I A W A R T E L L N V X M N A K H H K E K P G P E D K L H E Q X SEQ ID NO: 52 X P W R K N A X X S T X T X Q E A X K D V S Y L Y R F N A P A C K R H F I Q D T C L Y E X S P N L G P X I Q Q V D Q S X R K E R V L N V W F D P A Q G N P N E Q V A CAA44610 M A Q R M T T Q L L L L L V W V A V V G E A Q T R I A W A R T E SEQ ID NO: 53 L L N V C M N A K H H K E K P G P E D K L H E Q C R P W R K N A C C S T N T S Q E A H K D V S Y L Y R F N W N H C G E M A P A C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R V L N V P L C K E D C E Q W W E D C R T S Y T C K S N W H K G W N W T S G F N K C A V G A A C Q P F H F Y F P T P T V L C N E I W T H S Y K V S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A A A M S G A G P W A A W P F L L S L A L M L L W L L S CAA83553 M A W Q M M Q L L L L A L V T A A G S A Q P R S A R A R T D L L SEQ ID NO: 54 N V C M N A K H H K T Q P S P E D E L Y G Q C S P W K K N A C C T A S T S Q E L H K D T S R L Y N F N W D H C G K M E P T C K R H F I Q D S C L Y E C S P N L G P W I R Q V N Q S W R K E R I L N V P L C K E D C E R W W E D C R T S Y T C K S N W H K G W N W T S G I N E C P A G A L C S T F E S Y F P T P A A L C E G L W S H S F K V S N Y S R G S G R C I Q M W F D S A Q G N P N E E V A K F Y A A A M N A G A P S R G I I D S AAA74896 T R I A W A R T E L L N V C M N A K H H K E K P G P E D K L H E SEQ ID NO: 55 Q C R P W R K N A C C S T N T S Q E A H K D V S Y L Y R F N W N H C G E M A P A C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R V L N V P L C K E D C E Q W W E D C R T S Y T C K S N W H K G W N W T S G F N K C A V G A A C Q P F H F Y F P S P T V L C N E I W T H S Y K V S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A A A M S G A G P W A A W P F L L S L A AAA49056 M L R F A I T L F A V I T S S T C Q Q Y G C L E G D T H K A K P S P E P N M H E C T L Y S E S S C C Y A N F T E Q L A H S P I I K V S N S Y W N R C G Q L S K S C E D F T K K I E C F Y R C S P SEQ ID NO: 56 H A A R W I D P R Y T A A I Q S V P L C Q S F C D D W Y E A C K D D S I C A H N W L T D W E R D E S G E N H C K S K C V P Y S E M Y A N G T D M C Q S M W G E S F K V S E S S C L C L Q M N K K D M V A I K H L L S E S S E E S S S M S S S E E H A C Q K K L L K F E A L Q Q E E G E E R R AAA37599 M A W K Q T P L L L L V Y M V T T G S G R D R T D L L N V C M D SEQ ID NO: 57 A K H H K T K P G P E D K L H D Q C S P W K K N A C C S V N T S Q E L H K A D S R L Y F N W D H C G K M E P A C K S H F I Q D S C L Y E C S P N L G P W I Q Q V D Q S W R K E R F L D V P L C K E D C H Q W W E A C R T S F T C K R D W H K G W D W S S G I N K C P N T A P C H T F E Y Y F P T P A S L C E G L W S H S Y K V S N Y S R G S G R C I Q M W F D S T Q G N P N E D V V K F Y A S F M T S G T V P H A A V L L V P S L A P V L S L W L P G AAA37598 M A H L M T V Q L L L L V M W M A E C A Q S R A T R A R T E SEQ ID NO: 58 AAA37597 M F G L K F F L V L E A L L F L F T C Y I V L K I G L K I L SEQ ID NO: 59 AAA37594 M A W K Q T P L L L L V Y M V T T G S G R D R T D L L N V C M D SEQ ID NO: 60 A K H H K T K P G P E D K L H D Q AAA37596 M A H L M T V Q L L L L V M W M A E C A Q S R A T R A RT E L L SEQ ID NO: 61 N V C M D A K H H K E K P G P E D N L H D Q AAA37595 M A H L M T V Q L L L L V M W M A E C A Q S R A T R A R T E L L SEQ ID NO: 62 N V C M D A K H H K E K P G P E D N L H D Q C S P W K T N S C C S T N T S Q E A H K D I S Y L Y R F N W N H C G T M T S E C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R I L D V P L C K E D C Q Q W W E D C Q S S F T C K S N W H K G W N W S S G H N E C P V G A S C H P F T F Y F P T S A A L C E E I W S H S Y K L S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A E A M S G A G L H G T W P L L C S L S L V L L W V I S AAA35824 T R I A W A R T E L L N V C M N A K H H K E SEQ ID NO: 63 AAA35823 M A Q R M T T Q L L L L L V W V A V V G E A Q T R I A W A R T E SEQ ID NO: 64 L L N V C M N A K H H K E K P G P E D K L H E Q C R P W R K N A C C S T N T S Q E A H K D V S Y L Y R F N W N H C G E M A P A C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R V L N V P L C K E D C E Q W W E D C R T S Y T C K S N W H K G W N W T S G F N K C A V G A A C Q P F H F Y F P T P T V L C N E I W T H S Y K V S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A A A M S G A G P W A A W P F L L S L A L M L L W L L S AAA35822 M A Q R M T T Q L L L L L V W V A V V G E A Q T R I A W A R T E  SEQ ID NO: 65 L L N V C M N A K H H K E K P G P E D K L H E Q C R P W R K N A C C S T N T S Q E A H K D V S Y L Y R F N W N H C G E M A P A C K R H F I Q D T C L Y E C S P N L G P W I Q Q V D Q S W R K E R V L N V P L C K E D C E Q W W E D C R T S Y T C K S N W H K G W N W T S G F N K C A V G A A C Q P F H F Y F P T P T V L C N E I W T H S Y K V S N Y S R G S G R C I Q M W F D P A Q G N P N E E V A R F Y A A A M S G A G P W A A W P F L L S L A L M L L W L L S AAA35821 M V W K W M P L L L L L V C V A T M C S A Q D R T D L L N V C M SEQ ID NO: 66 D A K H H K T K P G P E D K L H D Q C S P W K K N A C C T A S T S Q E L H K D T S R L Y N E N W D H C G K M E P A C K R H E I Q D T C L Y E C S P N L G P W I Q Q V N Q T W R K E R F L D V P L C K E D C Q R W W E D C H T S H T C K S N W H R G W D W T S G V N K C P A G A L C R T F E S Y F P T P A A L C E G L W S H S Y K V S N Y S R G S G R C I Q M W F D S A Q G N P N E E V A R F Y A A A M H V N A G E M L H G T G G L L L S L A L M L Q L W L L G AAA18382 M A W Q M M Q L L L L A L V T A A G S A Q P R S A R A R T D L L SEQ ID NO: 67 N V C M N A K H H K T Q P S P E D E L Y G Q C S P W K K N A C C T A S T S Q E L H K D T S R L Y N F N W D H C G K M E P T C K R H E I Q D S C L Y E C S P N L G P W I R Q V N Q S W R K E R I L N V P L C K E D C E R W W E D C R T S Y T C K S N W H K G W N W T S G I N E C P A G A L C S T F E S Y F P T P A A L C E G L W S H S E K V S N Y S R G S G R C I Q M W E D S A Q G N P N E E V A K F Y A A A M N A G A P S R G I I D S AAA17370 M V W K W M P L L L L L V C V A T M C S A Q D R T D L L N V C M SEQ ID NO: 68 D A K H H K T K P G P E D K L H D Q C S P W K K N A C C T A S T S Q E L H K D T S R L Y N E N W D H C G K M E P A C K R H E I Q D T C L Y E C S P N L G P W I Q Q V N Q T W R K E R F L D V P L   C K E D C Q R W W E D C L T S H T C K S N W H R G W D W T S G V N K C P A G A L C R T F E S Y F P T P A A L C E G L W S H S Y K V S N Y S R G S G R C I Q M W F D S A Q G N P N E E V A R F Y A A A M H V N A G E M L H G T G G L L L R L A L M L Q L W L L G  U02715 tctcattggg tcccattggc ctgaccctaa agcctgggtt cttttccacc agacctaatc SEQ ID NO: 69 tccatcgagc tggccttatc ctaagaacca cttggggtat ctataaaatc cagatgcccc ctggtgatga gcaattctct agattttgat gaaagttgaa tgtgtggatg ctggaatgag taaattaaca agtaaggaga tgaatgcaag caggaatgac taaatggaca gactcaggga gccttgaaga gggtggggtc tggaagggaa ggaagagagg aaggagaata gctaagtagg gagatttcac tcagtgctta ccagagcgcg ttgtctaccc tgtaccgaag acagaggctg tggggacagc ctaggggcct ggatctattg cctacttaga gagaggccaa ctcagacaca gccgtgtatg ctcccagcag caacggaggt tcaggcaaga tgcccgaaga gggaaggg BE518506 gggggctggg acaggcggta gctcgcctcg cggcggaccg ccagctcgat cccgagatcc SEQ ID NO: 70 aactacgagc tttttaactg cagcaacttt aagatacgct attggagctg gaattaccgc ggctgctggc accagacttg ccctccaatg gatcctcgtt aaaggattta aagtgtactc attccaatta cagggcctcg aaagagtcct gtattgttat ttttcgtcac tacctccccg agtcgggagt gggtaatttg cgcgcctgct gccttccttg gatgtggtag ccgtttctca ggctccctct ccggaatcga accctgattc cccgttaccc gtggtcacca tggtaggcac agaaagtacc atcgaaagtt gatagggcag acattcgaat gagacgtcac cgccacaaag ggcgcgcgat cggctcgagg ttatctagag tcaccaaagc ggccggggca accgagattg gcccgcatgg gttttgggtc tgataaatgc acgcatcccc ggaggtcagc gctcgtctgc atgtattagc tctagaattg ccacagttat ccaagtaacg ttggagcgat caaaggaacc ataactgatt taatgagcca ttcgcagttt cactgtaccg gccgtgtgta cttagacttg catggcttaa tctttgagac aagcatatgc tactggcagg a BG058247 gcggccgcct actactacta ctactgctcg aattcaagct tctaacgatg tacggggaca SEQ ID NO: 71 tgccgacggg cgctgacccc cttcgcgggg gggatgcgtg catttatcag atcaaaacca acccggtcag cccctctccg gccccggccg gggggcgggc gccggcggct ttggtgactc tagataacct cgggccgatc gcacgccccc cgtggcggcg acgacccatt cgaacgtctg ccctctccct taccaggacc acagctctgt tccttcggcc tctggtcctc tctggtcccc tcctgggttt cttacgtagt tgatttttcc tctttagtct cccccgacct gcgccc BG017460 tttttttttt tttcaaagta aacgcttcgg gcccccggga cactcagtca agagcatcgg SEQ ID NO: 72 ggaggcgccg agaggcaggg gctgggacag gcggtagctc gcctcgcggc ggaccgccag ctcgatccca agatccaact acgagctttt taact NM_000802 tcaagattaa acgacaagga cagacatggc tcagcggatg acaacacagc tgctgctcct SEQ ID NO: 73 tctagtgtgg gtggctgtag taggggaggc tcagacaagg attgcatggg ccaggactga gcttctcaat gtctgcatga acgccaagca ccacaaggaa aagccaggcc ccgaggacaa gttgcatgag cagtgtcgac cctggaggaa gaatgcctgc tgttctacca acaccagcca ggaagcccat aaggatgttt cctacctata tagattcaac tggaaccact gtggagagat ggcacctgcc tgcaaacggc atttcatcca ggacacctgc ctctacgagt gctcccccaa cttggggccc tggatccagc aggtggatca gagctggcgc aaagagcggg tactgaacgt gcccctgtgc aaagaggact gtgagcaatg gtgggaagat tgtcgcacct cctacacctg caagagcaac tggcacaagg gctggaactg gacttcaggg tttaacaagt gcgcagtggg agctgcctgc caacctttcc atttctactt ccccacaccc actgttctgt gcaatgaaat ctggactcac tcctacaagg tcagcaacta cagccgaggg agtggccgct gcatccagat gtggttcgac ccagcccagg gcaaccccaa tgaggaggtg gcgaggttct atgctgcagc catgagtggg gctgggccct gggcagcctg gcctttcctg cttagcctgg ccctaatgct gctgtggctg ctcagctgac ctccttttac cttctgatac ctggaaatcc ctgccctgtt cagccccaca gctcccaact atttggttcc tgctccatgg tcgggcctct gacagccact ttgaataaac cagacaccgc acatgtgtct tgagaattat ttgg U20391 taagttgaca cttctcaggt tgtcacaaga ttcaggtatg gctcactgtt gcaggacata SEQ ID NO: 74 agctgggatc tcctgggaat tggtctgctt gcaggcccta gagagccttc cttcttggtt gattttcctc tagagatcca actgtcttct caggctcccc tgcctgcctc ctccttgggt cctttcttgt ggcattgcca gattactggg cccccatttt ccctacactt actgccactc atagtctgat ggttcccaca tctgcatcca acctggactc ttcccctgag ctttcccctc tacaaccacc ttccccgggc caagggcaca caggcacctc gacaaaacag tgttctatgt ttcttcctgc ccaaacctgc ccctccctct cccttttccc atctgtggta ccaccatggg ctcagagaat aaaaaaaatg aaggcttctg tcattgactg gggtggagat ggagggaaga gttagcccag aatcacaggt gctgtagaaa ggatacctga gttgccggga gagggggtcc atgagttggg gatggaagga gagcttggcc cttcaaacaa ttgaagatct gatcaaaaga ttcagaacat ctgtgatttt gtggctggtg atgggtgaca cctgggctaa tggggttggg ggagttggtg gctctacaat ttatggcctt gggagatcct tgctctctat agctgactgg gaggttggaa gcctgggctc tagcccttgc cttgatcctc cggatctcat tttcctcatc tgcctaacag gacagagggg ttggaaactg atgagattag ctcaaaggat cctggcagct caggctgcaa gatttttttc agacctcagt gtttgggaaa aaattgggta ggtggagctt agggactggc cttaggcctg cactgttaat tcaccccctc ccactacccc atggaggcct ggctggtgct cacatacaat aattaactgc tgagtggcct tcgcccaatc ccaggctcca ctcctgggct ccattcccac tccctgcctg tctcctaggc cactaaacca cagctgtccc ctggaataag gcaaggggga gtgtagagca gagcagaagc ctgagccaga cggagagcca cctcctctcc caggtatgtg acactcccca tcccccttca gaggccacac accctatggc attcccacca tgtgttaagg attttctgaa ctggaagggc cctctgtttg cctgaaggcc agagaatctt gaagtggaga ctgaggccca gaccagagtg tggcctgctc aagattaaac gacaagttag tgttcatccc cctgaactag tacctgggct ctagcccttc agtccagagc tgagttctca gctcttctag tctggggccc caaggttggg tgtgggggtc atgattgttg gtggggaggg gtcacagctg gactaagacc tgaaggtgag actaggcagg tgggaaagga gcttgcagag tgatgctgct caaaaggaca ggaagagagc ctggcttcag aagcagccac agcaagagag actactgact gaacaggtgg gctccactgg gggctccgga aaggattttc tcagccccca tccccagcac tgtgtgttgg ccgcacccat gagagcctca gcactctgaa ggtgcagggg gcaaaggcca aaagagctct ggcctgaact tgggtggtcc ctactgtgtg acttggggca tggccctcat ctgtgctgaa atgattccac aaagattaaa ctggctatca tttgttgatt tcccccttct tacatttaat ccttgcagga gaaagctaag cctcaagata gtttgcttct ctttccccca aggccaagga gaaggtggag tgagggctgg ggtcgggaca ggttgaacgg gaaccctgtg ctctaaacag ttagggtttg ttcccgcagg aactgaaccc aaaggatcac ctggtattcc ctgagagtac agatttctcc ggcgtggccc tcaaggttag tgagtgagca ggtccacagg ggcatgattg gatcctggaa tgaatgaatc aaccatgaga gagtgaatga acactggaat caatagagta gcagagtaat ggattgtgga gcaggaaaga gagctgctgg gtgggaattc aattccaggc ttatatgagc cctgctgtgc agtcggcctg gagacagccc agctcaggcc ctgcctagac ccctgtcaag gaggccctgt caagaggaga ggaggggcag cacgggggca aggcaagctt gtgagcggga aaggcatgtc cactttagcg actggtatgt ggaagatgag ttagaggaga cagatggaga gaagtcatag gaaataaatt ctgagcattt taggagggcc cagacacctg gtgtccagtg gagtgaagga aacagtcgcc tcccaaaatt cagtgtctga ggtcaaagga ttgaagttct gtgatgacca aggagaagcc agctctgtgg tagggggcac aggagctccc caaggcccca gggctgtcca gctggctgtc ccctgccagc acccatgtcc tgtgacccca ccccaccaag atcccatggt ttccgggaag ggcctactaa actagcttga gtgatgaggc tagaaagggg ctgggaccaa ggtttaaaaa gcaaaacaaa ctaacaaaaa ccacactgca gcccccccaa ctaaaacatt tttataaact tttttttttt ttttgagatg gagtctcgct ctgtcaccca ggctagagtg caatggcaca atcttggctc actgtaacct ccacctcctg gattcaagtg attctcctgc ctcagcctcc cacgtagctg ggactacagg cacacgacac cgcacccagc tcattttgta tttttagtag agacagggtt tcactatgtt ggccaggctg gtctcaaact tctgacctca ggtgatccac ccacctcagc cttccaaagt gctgggatta caggcatgag ccaccgcgcc cagcccattt ttgtaaactt ttacaatgaa gtaatttggt gtcaaaatct gacctgaaaa ttaatgtgag tttatgtata gttttaattt atcccactag tgtaactgtt tcaccccaga atatacactt gattattggg tatatgaaaa aaatattttc tttgaatcac ctttgatgaa atcctaaaaa attttaaccc tgaaacattt gaataaggca ttgtggacct atggcaaact cctggctatt NM_016731 agggacagac atggctcagc ggatgacaac acagctgctg ctccttctag tgtgggtggc SEQ ID NO: 75 tgtagtaggg gaggctcaga caaggattgc atgggccagg actgagcttc tcaatgtctg catgaacgcc aagcaccaca aggaaaagcc aggccccgag gacaagttgc atgagcagtg tcgaccctgg aggaagaatg cctgctgttc taccaacacc agccaggaag cccataagga tgtttcctac ctatatagat tcaactggaa ccactgtgga gagatggcac ctgcctgcaa acggcatttc atccaggaca cctgcctcta cgagtgctcc cccaacttgg ggccctggat ccagcaggtg gatcagagct ggcgcaaaga gcgggtactg aacgtgcccc tgtgcaaaga ggactgtgag caatggtggg aagattgtcg cacctcctac acctgcaaga gcaactggca caagggctgg aactggactt cagggtttaa caagtgcgca gtgggagctg cctgccaacc tttccatttc tacttcccca cacccactgt tctgtgcaat gaaatctgga ctcactccta caaggtcagc aactacagcc gagggagtgg ccgctgcatc cagatgtggt tcgacccagc ccagggcaac cccaatgagg aggtggcgag gttctatgct gcagccatga gtggggctgg gccctgggca gcctggcctt tcctgcttag cctggcccta atgctgctgt ggctgctcag ctgacctcct tttaccttct gatacctgga aatccctgcc ctgttcagcc ccacagctcc caactatttg gttcctgctc catggtcggg cctctgacag ccactttgaa taaaccagac accgcacatg tgtcttgaga attatttgg NM_016730 ggaaaggatt ttctcagccc ccatccccag cactgtgtgt tggccgcacc catgagagcc SEQ ID NO: 76 tcagcactct gaaggtgcag ggggcaaagg ccaaaagagc tctggcctga acttgggtgg tccctactgt gtgacttggg gcatggccct catctgtgct gaaatgattc cacaaagatt aaactggcta tcatttgttg atttccccct tcttacattt aatccttgca ggagaaagct aagcctcaag atagtttgct tctctttccc ccaaggccaa ggagaaggtg gagtgagggc tggggtcggg acaggttgaa cgggaaccct gtgctctaaa cagttagggt ttgttcccgc aggaactgaa cccaaaggat cacctggtat tccctgagag tacagatttc tccggcgtgg ccctcaaggg acagacatgg ctcagcggat gacaacacag ctgctgctcc ttctagtgtg ggtggctgta gtaggggagg ctcagacaag gattgcatgg gccaggactg agcttctcaa tgtctgcatg aacgccaagc accacaagga aaagccaggc cccgaggaca agttgcatga gcagtgtcga ccctggagga agaatgcctg ctgttctacc aacaccagcc aggaagccca taaggatgtt tcctacctat atagattcaa ctggaaccac tgtggagaga tggcacctgc ctgcaaacgg catttcatcc aggacacctg cctctacgag tgctccccca acttggggcc ctggatccag caggtggatc agagctggcg caaagagcgg gtactgaacg tgcccctgtg caaagaggac tgtgagcaat ggtgggaaga ttgtcgcacc tcctacacct gcaagagcaa ctggcacaag ggctggaact ggacttcagg gtttaacaag tgcgcagtgg gagctgcctg ccaacctttc catttctact tccccacacc cactgttctg tgcaatgaaa tctggactca ctcctacaag gtcagcaact acagccgagg gagtggccgc tgcatccaga tgtggttcga cccagcccag ggcaacccca atgaggaggt ggcgaggttc tatgctgcag ccatgagtgg ggctgggccc tgggcagcct ggcctttcct gcttagcctg gccctaatgc tgctgtggct gctcagctga cctcctttta ccttctgata cctggaaatc cctgccctgt tcagccccac agctcccaac tatttggttc ctgctccatg gtcgggcctc tgacagccac tttgaataaa ccagacaccg c NM_016729 cattccttgg tgccactgac cacagctctt tcttcaggga cagacatggc tcagcggatg SEQ ID NO: 77 acaacacagc tgctgctcct tctagtgtgg gtggctgtag taggggaggc tcagacaagg attgcatggg ccaggactga gcttctcaat gtctgcatga acgccaagca ccacaaggaa aagccaggcc ccgaggacaa gttgcatgag cagtgtcgac cctggaggaa gaatgcctgc tgttctacca acaccagcca ggaagcccat aaggatgttt cctacctata tagattcaac tggaaccact gtggagagat ggcacctgcc tgcaaacggc atttcatcca ggacacctgc ctctacgagt gctcccccaa cttggggccc tggatccagc aggtggatca gagctggcgc aaagagcggg tactgaacgt gcccctgtgc aaagaggact gtgagcaatg gtgggaagat tgtcgcacct cctacacctg caagagcaac tggcacaagg gctggaactg gacttcaggg tttaacaagt gcgcagtggg agctgcctgc caacctttcc atttctactt ccccacaccc actgttctgt gcaatgaaat ctggactcac tcctacaagg tcagcaacta cagccgaggg agtggccgct gcatccagat gtggttcgac ccagcccagg gcaaccccaa tgaggaggtg gcgaggttct atgctgcagc catgagtggg gctgggccct gggcagcctg gcctttcctg cttagcctgg ccctaatgct gctgtggctg ctcagctgac ctccttttac cttctgatac ctggaaatcc ctgccctgtt cagccccaca gctcccaact atttggttcc tgctccatgg tcgggcctct gacagccact ttgaataaac cagacaccg NM_016725 tggaggcctg gctggtgctc acatacaata attaactgct gagtggcctt cgcccaatcc SEQ ID NO: 78 caggctccac tcctgggctc cattcccact ccctgcctgt ctcctaggcc actaaaccac agctgtcccc tggaataagg caagggggag tgtagagcag agcagaagcc tgagccagac ggagagccac ctcctctccc agggacagac atggctcagc ggatgacaac acagctgctg ctccttctag tgtgggtggc tgtagtaggg gaggctcaga caaggattgc atgggccagg actgagcttc tcaatgtctg catgaacgcc aagcaccaca aggaaaagcc aggccccgag gacaagttgc atgagcagtg tcgaccctgg aggaagaatg cctgctgttc taccaacacc agccaggaag cccataagga tgtttcctac ctatatagat tcaactggaa ccactgtgga gagatggcac ctgcctgcaa acggcatttc atccaggaca cctgcctcta cgagtgctcc cccaacttgg ggccctggat ccagcaggtg gatcagagct ggcgcaaaga gcgggtactg aacgtgcccc tgtgcaaaga ggactgtgag caatggtggg aagattgtcg cacctcctac acctgcaaga gcaactggca caagggctgg aactggactt cagggtttaa caagtgcgca gtgggagctg cctgccaacc tttccatttc tacttcccca cacccactgt tctgtgcaat gaaatctgga ctcactccta caaggtcagc aactacagcc gagggagtgg ccgctgcatc cagatgtggt tcgacccagc ccagggcaac cccaatgagg aggtggcgag gttctatgct gcagccatga gtggggctgg gccctgggca gcctggcctt tcctgcttag cctggcccta atgctgctgt ggctgctcag ctgacctcct tttaccttct gatacctgga aatccctgcc ctgttcagcc ccacagctcc caactatttg gttcctgctc catggtcggg cctctgacag ccactttgaa taaaccagac accg NM_016724 ggcaaggggg agtgtagagc agagcagaag cctgagccag acggagagcc acctcctctc SEQ ID NO: 79 ccaggaactg aacccaaagg atcacctggt attccctgag agtacagatt tctccggcgt ggccctcaag ggacagacat ggctcagcgg atgacaacac agctgctgct ccttctagtg tgggtggctg tagtagggga ggctcagaca aggattgcat gggccaggac tgagcttctc aatgtctgca tgaacgccaa gcaccacaag gaaaagccag gccccgagga caagttgcat gagcagtgtc gaccctggag gaagaatgcc tgctgttcta ccaacaccag ccaggaagcc cataaggatg tttcctacct atatagattc aactggaacc actgtggaga gatggcacct gcctgcaaac ggcatttcat ccaggacacc tgcctctacg agtgctcccc caacttgggg ccctggatcc agcaggtgga tcagagctgg cgcaaagagc gggtactgaa cgtgcccctg tgcaaagagg actgtgagca atggtgggaa gattgtcgca cctcctacac ctgcaagagc aactggcaca agggctggaa ctggacttca gggtttaaca agtgcgcagt gggagctgcc tgccaacctt tccatttcta cttccccaca cccactgttc tgtgcaatga aatctggact cactcctaca aggtcagcaa ctacagccga gggagtggcc gctgcatcca gatgtggttc gacccagccc agggcaaccc caatgaggag gtggcgaggt tctatgctgc agccatgagt ggggctgggc cctgggcagc ctggcctttc ctgcttagcc tggccctaat gctgctgtgg ctgctcagct gacctccttt taccttctga tacctggaaa tccctgccct gttcagcccc acagctccca actatttggt tcctgctcca tggtcgggcc tctgacagcc actttgaata aaccagacac cg NM_013307 atggcctcag ttccgaaaac caacaaaata gaaccgcggt cctattccat tattcctagc SEQ ID NO: 80 tgcagtatca ggcggctcgg gcctgctttg aacactccaa tttttcaaag taaacgcaac gggccccgcg gacactcagc ttacagcatc gaggggcgcc agaggcaagg ggcgggacgg gcggtggtcc ctcgcgcgga ccgcccgccc gctcccaaga tccaactacg agctttttac ctgcagcaac tttactatac gctattggag ctggaattac cgcggctgct ggcaccagac ttgccctcca atggctcctc gttaaaggat ttaaagtgga ctcattccaa ttacagggcc tcgaaagagt cctgtattgt tattttcgtc actacctccc cgggtcggga gtgggtaatt tgcgcgcctg ctgccttcct tggatgtggt agcctccagg ctccctctcc ggaatctgaa ccctcattcc ccgtcacccg tggtcaccat ggtcggcacg gcgactacca tcgaaagttg atagggcaga cgttcgaatg ggtcgtcgtc cgccgccacg gggggcgtgc gatcggcccg aggttatcta gagtcaccaa agccgccggc gcccgccccc cggccggggc cggagagggg ctgagggttg gttttgatct gataaatgca ccgatccccc ccgcgaaggg ggtcagcgcc cgtcggcatg tattagctct agaattacca cagttatcca agtag NM_008035 gctttagagg cagatcaggg tgtagttttc agctagcgcc gtgccttccc caccatgttc SEQ ID NO: 81 cttgccatga tgataatgta ctagacctct gaaactgtag cttctttgtt acagagtctc cgtgaatctg gaattcacca attcggcgag tctgaaagcc tcagtgatct ctcaggctcc atctgtctcc actccccagt ggaaggcttg cagctgtgtc accgctccag acttcacaca ggtgctggaa gactgaacta agacagaaag acatggcctg gaaacagaca ccactcttgc ttttggtcta catggtcaca acaggcagtg gccgggacag aacagaccta ctcaacgttt gcatggatgc caaacaccat aagacaaagc cgggccccga ggacaagctg catgaccagt gtagtccatg gaagaaaaat gcctgttgct cagtcaacac cagccaggag ctacacaagg ctgactcccg tctgtacttc aactgggatc actgtggcaa gatggagcct gcctgtaaga gtcacttcat ccaagactcc tgcctgtatg agtgctcccc caaccttggg ccttggatcc agcaagtgga ccagagttgg cgtaaagagc gtttcctgga tgtgccctta tgcaaagagg actgtcacca gtggtgggaa gcctgtcgta cctcctttac ctgcaagaga gactggcata aaggctggga ctggtcctca ggcattaaca agtgcccaaa cacagcaccc tgtcacacgt ttgagtacta cttcccgaca ccagccagcc tttgcgaggg tctctggagt cactcctaca aggtcagcaa ctacagcaga gggagtggcc gctgcatcca gatgtggttt gactcaaccc agggcaatcc caatgaggac gtggtgaagt tttatgcttc ctttatgaca tctgggactg tgccccatgc agcagtactt cttgtgccca gcctggcccc agtgctgtca ttatggctcc ctggctgaga ggtcagtctt cctctctaga tttctcctct atctaccctt ggtctggttc aactcttcaa agaataagga agtcttgagc ctgcttccac ccctctcctc tgtcatccag ttcctgatcc atgttggggg ttggggtttc tacaatcatt ttcaataaat ctatgacaca tctgggccta atgaaaaaaa aaa NM_008034 tggagctgag cacacacttg gaggttccac ttaccttagc tctgccttca gggtctgaca SEQ ID NO: 82 tggctcacct gatgactgtg cagttgttgc tcctggtgat gtggatggcc gaatgtgctc agtccagagc tactcgggcc aggactgaac ttctcaatgt ctgcatggat gccaagcacc acaaagaaaa accgggccct gaggacaatt tacacgacca gtgcagcccc tggaagacga attcctgctg ttccacgaac acaagccagg aagcacataa ggacatttcc tacctgtacc ggttcaactg gaaccactgc ggaactatga catcggaatg caaacggcac tttatccaag acacctgcct ctatgagtgt tccccgaact tgggaccctg gatccagcag gtggaccaga gctggcgcaa agagcggatc cttgatgttc ccctgtgcaa agaggactgt cagcagtggt gggaggactg ccagagctct tttacctgca agagcaattg gcacaaggga tggaactggt cctctgggca taacgagtgt cctgtgggag cctcctgcca tcccttcacc ttctacttcc ccacatctgc tgctctgtgt gaggaaatct ggagtcactc ctacaagctc agcaactaca gccgagggag cggccgctgc attcagatgt ggtttgaccc agcccagggc aaccccaacg aggaagtggc gaggttctat gccgaggcca tgagtggagc tgggcttcat gggacctggc cactcttgtg cagcctgtcc ttagtgctgc tctgggtgat cagctgagtt cctgttttac cttcagttgt ctggagcgcc accctgcttg gctcagcctc ccagctccca gcctcctttg tggtggggct ctgacagcct ctttaataaa ccagacattc cacatgtgcc ttatgaatta aaaaaaaaaa aaaaaaaaa BF153292 ctccgatccc gaaggccaac gtaataggac cgaaatccta taatgttatc ccatgctaat SEQ ID NO:83 gtatacagag cgtaggcttg ctttgagcac tctaatttct tcaaagtaac agcgccggag gcacgacccg gccaattaag gccaggagcg catcgccgac agaagggacg agacgaccgg tgcacaccta gggcggaccg gccggcccat cccaaagtcc aactacgagc tttttaactg caacaactta aatatacgct attggagctg gaattaccgc ggctgctggc accagacttg ccctccaatg gatcctcgtt aagggattta gattgtactc attccaatta ccagactcat agagcccggt attgttattt attgtcacta cctccccgtg tcaggattgg gtaatttgcg cgcctgctgc cttccttgga tgtggtagcc gtttctcagg ctccctctcc ggaatcgaac cctaattctc cgtcacccgt caccaccatg gtaggccact atcctaccat cgaaagttga tagggcagaa atttgaatga tgcgtcgccg gcacgatggc cgtgcgatcc gtcgagttat catgaatcat cgcagcaacg ggcagagccc gcgtcgacct tttatct BF114518 cgacgcatca ttcaaatttc tgccctatca actttcgatg gtaggatagt ggcctaccat SEQ ID NO: 84 ggtggtgacg ggtgacggag aattagggtt cgattccgga gagggagcct gagaaacggc taccacatcc aaggaaggca gtaggcgcgc aaattaccca atcctgacac ggggaggtag tgacaataaa taacaatacc gggctcttcg agtctggtaa ttggaatgag tacaatctaa atcccttaac gaggatccat tggagggcaa gtctggtgcc BE940806 gcacggccct cgtgccggcg acgcatcatt caaatttctg ccctatcaac tttcgatggt  SEQ ID NO: 85 aggatagtgg cctgatacgg tggtgacggg tgacggagaa ttagggttcg attccggaga gggagcctga gaaacggcta ccacatccaa ggaaggcagc aggcgcgcaa attacccaat cctgatacgg ggaggtagtg acaataaata acaataccgg gctctcagag tctggtaatt ggatgagtac aatctaaccc ttaacgagga tccattggag ggcaagtctg gtgcacgagc BE858996 gaagacacac gtttagtatt ttattatgaa tcattatttc aaagtcccat actgcatatt SEQ ID NO: 86 catataaggc aacacggcac aatttcaggc ttcatcacaa aggatgaaaa agactgtttc taactccctc ctaatttgca gacatgcttg aacacttaat ggaaggtgaa gtttattttg tggcccctca gttctctttc aagtcctcta gtagaaagtc tccatggtgt gatcttctga ctgggtagaa cccgcaattc tctgctgttt ttagtctttg ttccagatga ctaattacat gacttggctg catttgtgag gggccgacac caacacaatt aaatcagtgc accattcagg gccatagggt aggaggcacc agtggtcacc atggtaggca cggcgactac catcgaaagt tgatagggca gacgttcgaa tgggtcgtcg ccg AF219906 gttgaagagt cacctggtgc ttcaacggga ctgatttcct gggcctggag ttggagatca SEQ ID NO: 87 gaggtctgac AF219905 cgctgatctg gaagcataaa caagaactga agctgaaggc tctaggggtt cccaacctgt SEQ ID NO: 88 gatctccagc agacactcct ggtgtgtcac cggattcagg ctcctgggat aaagaaagca aaggaagtct ggagtggaga cgaagaaacc ccaggcactc tgagagctgc taccttttcc atgtgtgctg ccagacactt ctcgtcaggg accaaatacc ccaagggagt ggagagaggc ctgggctggg ccagacttcc tgggctttaa cctgtgctcc aagtaggtgg gtcacatttt ccccagcggg agttgaagag tcacctggtg cttcaacggg actgatttcc tgggcctgga gttggagatc agaggtctga c AF219904 ggggctggag ttggagatca gaggtctgac atggctcacc tgatggctgg gcagtggttg SEQ ID NO: 89 ctcctgctga tgtggatggc cgaatgtgcc cagtccagag ctactcgggc caggaccgaa cttctcaatg tctgcatgga tgccaagcac cacaaagaaa agccaggccc agaggacaag ttacacgacc agtgcagccc ctggaagacg aatgcctgct gctccaccaa cacaagccag gaagacacta aggacatttc ctacctgtac cgattcaact ggaatcactg tggaactatg accccggagt gcaaacgtca ctttatccaa gacacctgcc tctatgagtg ttccccgaac ttgggaccct ggatccagca ggtggaccag agctggcgca aagagcggat ccttgatgtt cccctgtgca aagaagactg tgtgctgtgg tgggaggact gcaagagctc ttttacctgc aagagcaact ggctcaaggg atggaactgg acctcggggc ataatgagtg ccctgtggga gcctcctgcc atcccttcac tttctacttc cctacacctg ctgtgctgtg tgagaaaatc tggagtcact cctacaagct cagcaactac agccgaggga gcggccgctg catccagatg tggttcgacc cagcccaagg caaccccaac gaggaagtgg cgaggttcta tgccgaggtc atgagtggag ctgggcttcg cgaggcctgg ctgctggtgt gcagcctgtc cttagtgctg ttctgcgtcg tcagctgagt tcctgttact ccttgtctgg agctccaccc tgcccggctt agcctcccag ctccagcctc ctttgtggtg gggctctgac agcctgttta gtaaaccaga cattctaaaa aaaaaa BE687177 acccggtgag ctccctcccg gctccggccg ggggtcgggc gccggcggct ttggtgactc SEQ ID NO: 90 tagataacct cgggccgatc gcacgccccc aggtcaagtt tgtttatgaa ggtattttgg tattgttttc ctttgcttaa ttgcctcaca ttttgttctg aaaaacatgg gtccactgtt aaaaccgaat gtatgtgtag ctttattctg tttcacaggc gcatgtgatt ggaaaactca ttgtctcctc cagcctcagg agacttctaa aaagttttgc gtagctcaag ttgtgcatga attaccgaat atattatttt tcagcttttc ttcatgaacg atatttgaca tgtgctttgg tacccttctc tgaaagttga aaacctacct acttagtccc ttctgtgcct tttttatttt gccaaccatg ttttatggaa aagacattag caattacatt ttgcaaatgg aattatgt BE636622 ggcaccagag tagtcatatg cttgtgttaa agattaagcc atgcatgcct aagtacaaac SEQ ID NO: 91 tattcttatg gtaaaactgc ggacggctcc atagatcagt aatagttcgt tcagtgattt gaaaaagtac ttggataacc ctgttaattg tagagctaat acatgcaccg acggcctgat cgggtgaccg agagggtcgc acttgtctta attcacagtg ccccggaact gaggctgttc gacgtggtag gggaggacgc tgaatggggc tggtagaaac aactgggggt ataaaaccaa ggaggaagca aaaaagccat aacccggcga tggccttggt ggaaacctct gggctcaagg ttgttattat gttcattgtg gcctctcggg gttattttga atgtggtaat aaaccgaaag caactctatc agtttggttt ggatgtccgt taatcctgcg tggccagcgg ctttggggac tccaggggac agggcgaaac gaggcaattc aaagctgatc gctttctaac gagggcgaca cactgttcga attcctgacn tatcaactcg atggtaggat agtggcctac catggttata acgggtaacg gggaatcagg gctcgattcc ggagagggag cctgagaaac ggctaccact tccaaggaag gcagcaggcg cgtaaattac tccctgccga cacggcgagg tagtgacgac aaataccaag gaaaaccgcc tttggtggtt ttccattgga atgagcagaa ttcaaacccc tctgcaagta acaattggag ggcaagtctg gtgccagcag c BE627230 tccctcgact gtagttgctg agcttgtagg agtgactcca gatttcctca cacagagcag SEQ ID NO: 92 cagatgtggg gaagtagaag gtgaagggat ggcaggaggc tcccacagga tccctcgact gtagttgctg agcttgtagg agtgactcca gatttcctca cacagagcag cagatgtggg gaagtagaag gtgaagggat ggcaggaggc tcccacagga BE506561 ctatcgatat ccgatggtac ttgttgtgcc taccatggtg accccagttc atagcgaatg SEQ ID NO: 93 agggtgcgat ggcagagagg gaggatgtga tgcagctatc gcatgcggtg gatgctggag gcgcgcatgt tgcaccctcc cgacggcgag aggtggtgac tacccatatc gtgcaggact ctttcgacgc gctgtagtct gaatgagtac actttaagtc cgtgagcgcg gatctatcgg ttggcgagtt tagtgccagc agcgcgaggc tttacagcct caatgtcgtg tatgacagtt gcgtgtcctt atggagcgtg agttggatca tggg BE505048 gcggccgcct actactacta aattcgcggc cgcgtcgacc gacgacccat tcgntcgtct SEQ ID NO: 94 gccctatcaa ctttcgntgg ttgtcgccgt gcctaccatg gtgaccacgg gtgacgggga ttctgggttc gtttccggtg agggtgcctg tggggcggtt gcctcttctc tggttggctg caggcgcgct tttttcctcc tcccggcccg gggtggttgt BE496754 ctggctgcag gaattcgcac gaggctatat gctcagttta aagattaagc catgcatgtc SEQ ID NO: 95 gagttcatct ttgaagagaa actgcgaacg gctcattaga gcagatgtca tttattcgga acgtcctttt ggataactgc ggtaattctg gagctaatac atgcaaataa accctgactt ttgaaagggt gcaattatta gagcaaatca atcactttcg ggtgcagttt gctgactctg aataacgcag catatcggcg gcttgttcgc cgatattccg aaaaagtgtc tgccctatca acctgatggt agtctattag tctaccatgg ttattacggg taacggagaa taagggttcg actccggaga gggagcctta gaaacggcta ccacatccaa ggaaggcagc aggcgcgaaa cttatccaat cttgaacaga tgagatagtg actaaaaata aaaagaccat tcctatggaa cggtcatttc aatgagttga tcataaacct tttttcgagg atcaagtgga gggcaagtct ggtgccagca gccgcggtaa ttccagctcc actagtgtaa atcgtcattg ctgcggttaa aaagctcgta gttggatctg agttacatgc BB114010 atcatccaga tttcgtttga tttcaccccg ggccttccgg aggaggacct cctgaaattt SEQ ID NO: 96 tctccttcct atatgacatt agggactgtg ccccaagcag cagtactttt tgtccccagc ctgcccccag tgccgtcatt atggctcccc gctgagaggt cagttttcct ctctagattt ttcctctatt tacccttggt ctggttcaac ttttcaaaga ataaggaagt cttgaccctg cttccacccc tttcctctgt catccagttc ctgatccatg tggggggttg gggtttctac aatcattttc aataaattta tgacacatct gggcctaatg BB109527 aggacgtttg atgtcttatg cttcctttat gaaatccggg attgtgcccc atccagcagt SEQ ID NO: 97 attcttgtgc ccagcctggc cccactgcag tcattatgcc tccctggctg agaggtcatt cttcctcttt agatttctcc tcaatctacc cttgttctgg ttcaactctt caaagaataa ggaagtcttg accctgcttc cacccctttc ctctttcatc cagttcctga tccatgttgg gggttggggt ttctacattc attttcaata aatctatgac acac BB107219 tccgggcctt tccccccaca caccaaaaac ttttctgcct actctggccc cagcgctttc SEQ ID NO: 98 cttatgcctc cctggctgag aggtcatttt cttctataga tttctcctct atttaccctc gctctggttc aactcttcaa agaataagga acttttgagc ctgcttccac ccttttcctc tgtcatccag ttcctgatcc atgttggggg ttggggtttc tacaatcatt ttcaataaat ctatgacaca tctgggccta atg BE206324 tttttgtgcg gtgtctggtt tattcaaagt ggctgtcaga ggcccgacca tggagcagga SEQ ID NO: 99 accaaatagt tgggagctgt ggggctgaac agggcaggga tttccaggta tcagaaggta aaaggaggtc agctgagcag ccacagcagc attagggcca ggctaagcag gaaaggccag gctgcccagg gcccagcccc actcatggct gcagcataga acctcgccac ctcctcattg gggttgccct gggctgggtc gaaccacatc tggatgcagc ggccactccc tcggctgtag ttgctgacct tgtaggagtg agtccagatt tcattgcaca gaacagtggg tgtggggaag tagaaatgga aaggttggca ggcagctccc actgcgcact tgttaaaccc tgaagtccag tttcagccct tgtgccagtt gctcttgcag gtgtaggagg tgcgacaatc ttcccaccat tgctcacag BE448392 ggatggaact ggtcctcggg gcataacgag tgtcctgtgg gagcctcctg ccatcccttc SEQ ID NO: 100 accttctact tccccacatc tgctgctctg tgtgaggaaa tc BE207596 tagtgtgggt ggctgtagta ggggaggctc agacaaggat tgcatgggcc aggactgagc SEQ ID NO: 101 ttctcaatgt ctgcatgaac gccaagcacc acaaggaaaa gccaggcccc gaggacaagt tgcttgtagc agtgtcgacc ctggaggaag aatgcctggt gttctaccaa caccagccag gaagcccata aggatgtttc ctacctatat agattcaact ggaaccactg tggagagatg gcacctgcct gcaaacggca tttcatccag gacacctgcc tctacgagtg ctcccccaac ttggggccct ggatccagca ggtggatcag agctggcgca aagagcgggt actgaacgtg cccctgtgca aagaggactg tgagcaatgg tgggaagatt gtcgcacctc ctacacctgc aagagcaact ggcacaaggg ctggaactgg acttcagggt ttaacaagtg cgcagtggga gctgcctgcc aacctttcca tttctacttc cccacaccca ctgttctgtg caatgaaatc tggactcact cctacaggtc agcaactaca gccgagggag tgg BE206635 tgcggtgtct ggtttattca aagtggctgt cagaggcccg accatggagc aggaaccaaa SEQ ID NO: 102 tagttgggag ctgtggggct gaacagggca ttttatttcc aggtatcata ttgtttgttg tnggagctga ncagccacag cagcattagg gccaggctaa gcaggaaagg ccaggctgcc cagggcccag ccccactcat ggctgcagca tagaacctcg ccacctcctc attggggttg ccctgggctg ggtcgaacca catctggatg cagcggccac tccctcggct gtagttgctg accttgtagg agtgagtcca gatttcattg cacagaacag tgggtgtggg gaagtagaaa tggaaaggtt ggcaggcagc tcccactgcg cacttgttaa accctgaaga ccagttccag cccttgtgcc agttgctctt ggaggtgtag gaggtgccac aatcttccca ccattgctca cagtccttct tgcacagggg cacgttcaga accc BE240998 cgacgcatca ttcaaatttc tgccctatca actttcgatg gtaggatagt ggcctaccat SEQ ID NO: 103 ggtggtgacg ggtgacggag aattagggtt cgattccgga gagggagcct gagaaacggc taccacatcc aaggaaggca gtaggcgcgc aaattaccca atcctgacac ggggaggtag tgacaataaa taacaatacc gggctcttcg agtctggtaa ttggaatgag tacaatctaa atcccttaac gaggatccat tggagggcaa gtctggtgcc agcagccgcg gtaattccag ctccaatagc gtatatttaa g BE228221 tatatgctca gtttaaagat taagccatgc atgtcgagtt catctttgaa gagaaactgc SEQ ID NO: 104 gaacggctca ttagagcaga tgtcatttat tcggaacgtc cttttggata actgcggtca ttctggagct aatacatgca aataaaccct gacttttgaa agggtgcaat tattagagca aatcaatcac tttcgggtgc agtttgctga ctctgaataa cgcagcatat cggcggcttg ttcgccgata ttccgaaaaa gtgtctgccc tatcaacctg atggtagtct attagtctac catggttatt acgggtaacg gagaataagg gttcgactcc ggagagggag ccttagaaac ggctaccaca tccaaggaag gcagcaggcg cgaaacttat ccaatcttga acagatgaga tagtgactaa aaataaaaag accattccta tggaacggtc atttcaatga gttgatcata aacctttttt ccagttaatt ctac BE225416 tggccgggat tagaacaaaa ccacgcggct tcggctgctt cttgttgact cagaataact SEQ ID NO: 105 aagctgaccg catggccttg tgccggcggc gtgtctttca agcgtccact ttatcaactt gacgggagca taatcgactc ccgtggtggt gacggataac ggaggatcag ggttcgactc cggagaaggg gcctgagaaa tggccactac gtctaaggat ggcagcaggc gcgcaaatta cccactctcg gctcgaggag gtagtgacga gaaataacga gatcgttctc tttgaggccg gtcatcggaa tgagtacaat ttaaaccctt taacgagtat caagcagagg gcaagtctgg tgccagcagc cgcggtaatt ccagctctgc taatacatag aattattgct gcggttaaaa agctcgtagt tggattcgta tcggtaccct ggaaccctcc gggtgtttct gggtgttatc gatttatcgt aatgttcggt tttgagtcct taacaggatt cttaacaggc attgcaagtt tactttgaac aaatcagagt gcttcaaaca ggcgtttgcg ctgaatgatc gtgcatggat BE225404 tgcctaatgt gccaccgctg agtgtgatga tattgacaat cggtagcatt atggccgggt SEQ ID NO: 106 gtgtctattt caaagattaa gccatgcatg tataagttta aatcgttttg acgagaaacc gcgaacggct cattacaatg gccatgattt acttgatctt gattatctaa atggattaac tgtggaaaag ctagagctaa tacatgcacc aaaacttgtt cctctcggaa aagcgcattt attagaacaa aaccacgcgg cttcggctgc ttcttgtgac tcagaataac taagctgacc gcatggcctt gtgccggcgg cgtgtctttc aagcgtccac tttatcaact tgacgggagc ataatcgact cccgtggtgg tgacggataa cggaggatca gggttcgact ccggagaagg ggcctgagaa atggccacta cgtctaagga tggcagcagg cgcgcaaatt acccactctc ggctcgagga ggtagtgacg agaaataacg agatcgttct ctttgaggcc ggtcatcgga atgggtacaa tttaaaccct ttaacgagta tcaagcagag ggcaagtctg gtgccagcag ccgggtattc cagctctgct aatacataga atta BB214040 tccccaccct gcccccagtg ctgtcattat ggatccctgn ctgagaggtc aatcttcctt SEQ ID NO: 107 tctagatttt tcctctatct acccttggtc tggttcaaat tttcaaagaa taaggaagtc ttgagcctgc ttccacccct ctcctctttc atccagttcc taatccatgt tgggggttgg ggtttctaca atcattttca ataaatttat gacacatctg gg BE199619 gtagttgctg agcttgtagg agtgactcca gatttcctca cacagagcag cagatgtggg SEQ ID NO: 108 gaagtagaag gtgaagggat ggcaggaggc tcccacagga cactcgttat gccccgagga ccagttccat cccttgtgcc aattgctctt gcaggtaaaa gagctctggc agtcctccca ccactgctga cagtcctctt tgcacagggg aacatcaagg atccgctctt tgcgccagct ctggtccacc tgctggatcc agggtcccaa gttcggggaa cactcataga ggcaggtgtc ttggataaag t BE199597 acacagagca gcagatgtgg ggaagtagaa ggtgaaggga tggcaggagg ctcccacagg SEQ ID NO: 109 acactcgtta tgccccgagg accagttcca tcccttgtgc caattgctct tgcaggtaaa agagctctgg cagtcctccc accactgctg acagtcctct ttgcacaggg gaacatcaag gatccgctct ttgcgccagc tctggtccac ctgctggatc cagggtccca agttcgggga acactcatag aggcaggtgt cttggataaa gt BE198610 actgcggaac tatgacatcg gaatgcaaac ggcactttat ccaagacacc tgcctctatg SEQ ID NO: 110 agtgttcccc gaacttggga ccctggatcc agcaggtgga ccagagctgg cgcaaagagc ggatccttga tgttcccctg tgcaaagagg actgtcagca gtggtgggag gactgccaga gctcttttac ctgcangagc aattggcaca agggatggaa ctggtcctcg gggcataacg agtgtcctgt gggagcctcc tgccatccct tcaccttcta cttccccaca tctgctgctc tgtgtgagga aatct BE198571 actgcggaac tatgacatcg gaatgcaaac ggcactttat ccaagacacc tgcctctatg SEQ ID NO: 111 agtgttcccc gaacttggga ccctggatcc agcaggtgga ccagagctgg cgcaaagagc ggatccttga tgttcccctg tgcaaagagg actgtcagca gtggtgggag gactgccaga gctcttttac ctgcaagagc aattggcaca agggatggaa ctggtcctcg gggcataacg agtgtcctgt gggagcctcc tgccatccct tcaccttcta cttccccaca tctgctgctc tgtgtgagga aatct BE188055 gaggccagta gtcatatgct tgtctcaaag attaagccat gcatgtctaa gtataagcaa SEQ ID NO: 112 ctatacggtg aaactgcgaa tggctcatta aatcagttat cgtttatttg atagtacctt actacatgga taaccgtggt aattctagag ctaatacatg ctaaaaaccc cgacttcgga aggggtgtat ttattanata aaaaaccaac gcccctcggg gctccttggt gaatcataat aacttcacga atcgcatggc cttgcgccgg cgatggttca ttcaaatttc tgccctatca actttcgatg gtaggataga ggcctaccat ggtttcaacg ggtaacgggg aattagggtt cgactccgga gagggagcct gagaaacggc taccacatcc aaggaaggca gcaggcgcgc aaattaccca atcccgaccg gggagggagn gacaataaat actgatncng gctntttggg gtcttgnaat tggaatgagt ncaattaaat cccttaccag gaacaattgg aggcaanttg gngcccccan cncggnattc cactccatag cgttntaaag tttgcaatta aaagttgaat taacttggcc tggtggcggc ccctacgggt ctggccggcg gcnttntttg gggccgnncc tttatgnggg gggaacngct ttntt BE187804 tgacaattga atacggatgc ccccgactat ccctattaat cattacgggg gtcctagaaa SEQ ID NO: 113 ccaacaaaat anaaccacnc gtcctattct attattccat gctaatgtat tcgagcaaag gcctgctttg aacactntaa ttttttcaaa gnaaaagtcc tggttccccg acncncccag ngaagggcat gcggctcccc aaaaggaaag gcccggccgg accagtacac gcggngaggn ggaccggcca gccaggccca aggttcaact acgagctttt taactgcaac aactttaata tacgctattg gagctggaat taccgnggnt gctggcacca aacttgccct ccaattgttc ctcgttaagg ggatttaaat tgtactcatt ccaattacaa gacccaaaag agccctgtat cagtatttat tgncactact BB032646 tgtgccccat gcaacagtaa tttttgagcc caccctggcc ccagtgctgt cattatggct SEQ ID NO: 114 ccctggctga gaggtcagtt ttcctatcta gatttttcct gtatctaccc ttggtctggt tcaaattttc aaagaataag gaagtcttga gcctgcttcc acccctttcc tctgtcatcc agttcctgat ccatgttggg ggttggggtt tctacaatca ttttcaataa atctatgaca catctg BE037278 ccccctagat gctagtagca gtngncacga ggtcatatgc ttgtctcaaa gattaagcca SEQ ID NO: 115  tgcatgtgta agtatgaact aattcagact gtgaaactgc gaatggctca ttaaatcagt tatagtttgt ttgatggtac ctgctactcg gataaccgta gtaattctag agctaatacg tgcaacaaac cccgacttct ggaagggatg catttattag ataaaaggtc gacgcgggct ttgcccgttg ctctgatgat tcatgataac tcgacggatc gcacggnctt tgcgccggcg acgcatcatt caaatttctg ccctatcaac tttcgatggt aggatagtgg cctaccatgg tggtgacggg tgacggagaa ttagggttcg attccggaga gggagcctga gaaacggcta ccacatccaa ggaaggcagc aggcgcgcaa attacccaat cctgcacggg gaggtaggga caataaataa caataccggg ctcttcgagt ctggtaattg gaatgagtac aatctaaatc ccttaacgag gatacattgg agggccaagt ctgttgccag cagccgcggt atattccagc ttcaatagnc gtatatttaa agttgttggc agttaaaaag cttgtatttg gactctgggg tgggcgaccc ggtcgtctag cggtgtgcac cggc BE037125 gactactcat cagtgncagg ctagctgcac gaggtcatat gctcgtctca tagattaagc SEQ ID NO: 116  catgcatgtg taagtatgaa ctaattcaga ctgtgaaact gcgaatggct cattaaatca gttatagttt gtttgatggt acctgctact aggataaccg tagtaattct agagctaata cgtgcaacaa accccgactt ctggaaggga tgcatttatt agataaaagg tcgacgcggg ctttgcccgt tgctctgatg attcatgata actcgacgga tcgcacggcc tttgcgccgg cgacgcatca ttcaaatttc tgccctatca actttcgatg gtaggatagt ggcctaccat ggtggtgacg ggtgacggag aattagggtt cgattccgga gagggagcct gagaaacggc taccacatcc aaggaaggca tcaggcgcgc atattaccca atcctgacac ggcgaggtag tgacaataaa taacaatacc gggctcttcg agtctcggta atcggaatga gttcaatcta tatcccttta cgaggatcca ttggagggca agtcctgctg ccagcagcct gctgtccttt cagctccaat agcgtatatt taagttgttg cagtttaaca agctcttatt cgaccttgtc gtgcgaccgt tctcattacg ctatatgcct catcatatgt ccatatctat tctcgacttc tcgctcccct cgtcttctct agtacttctg cctcttctat tatattcact atgatctatt ctctacgcct cttcctctgc actcttatat tcatcgcact cttcactcta ctctctctta tcgtctgcta gtctttcgct tcttcctctt tctactttct catgtctctc atcttatctt accctctctc actctttctg ttcgtctcct ctcactctgc gatttctcca ctgtatcacg cttcgttctc tctactcttc tacttgttct ctctctatct cgtcctcatc tcctccgtct cgtctctatc gtcgtctacc gatactcttt ccttctctgt catcttcctc tctcttcctc tcttgcttac ttctcgtctc tcttcacgat tatcntctag cacgtcatct ctttactctc tctatcttca tgtctactca ctctctcctg tgcgtactac tcttggctat catcatctcc tagagtggct cgatgaggcg aatgtgcncn tctatctctc tacgttctct tactgatact tctttg BE037110 gtcgacgcac tagtgctata gtagcgttca tgcnagcngc acgaggagag agagagagag SEQ ID NO: 117 agagagagag agagagagag agagagagag agagagagag agagagagag agagagagag agagagagag agagagcggc acgagcttgt ctcaaagatt aagccatgca tgtgtaagta tgaactaatt cagactgtga aactgcgaat ggctcattaa atcagttata gtttgtttga tggtacctgc tactaggata accgtagtaa ttctagagct aatacgtgca acaaaccccg acttctggaa gggatgcatt tattagataa aaggtcgacg cgggctttgc ccgttgctct gatgattcat gataactcga cggatcgcac ggcctttgcg ccggcgacgc atcattcaaa tttctgccct atcaactttc gatggtagga tagtggccta ccatggtggt gacgggtgac agagaattag ggttcgattc cggagaggga gcctgagaaa cggctaccac atccaaggaa ggcagcatgc gcgcaaatta cccaatcctg acacggagag gtagtgacaa tatataacaa taccgcgctc ttcgagtctg gtaattggaa tgagtacaat ctatatccct taacgaggat ccattgtagg gcatgtctgg tgccagcagt cgcggtaatt tcagttccaa ttagcgatat ttaattcgtt gcagtaaaaa gctcgtattt gaactttgcg tgggcccacc taccgtctag cggtgtgcac tgtcttctct gcttttttcg gcatagcctc tgccttaaag cttgtctcgc actgctctta cttcgatatt tgatcttcat gcgctctctt ggatctcatc atggatacct aatgatctgc ctttctttgc ttggattcgc atcatcattg tacctggtct ttcgttctan ttagtatttc tcgattttat catcctgcta ccctactcga tttattttaa actatttgtc ttaacctatt tctttctctt cttacttcac tcttcctcgt aatctgtctt attatcactc ttcctcattt ctttattact gttcatttac ttatttactt tatttccttc tacatctttt ctctcatctt ctactcacgt cg BE037009 cccacactag ttctagagga ttcggcacga ggtctcaaag attaagccat gcatgtgtaa SEQ ID NO: 118 gtatgaacta attcagactg tgaaactgcg aatggctcat taaatcagtt atagtttgtt tgatggtacc tgctactagg ataaccgtag taattctaga gctaatacgt gcaacaaacc   ccgacttctg gaagggatgc atttattaga taaaaggtcg acgcgggctt tgcccgttgc tctgatgatt catgataact cgacggatcg cacggccttt gcgccggcga cgcatcattc aaatttctgc cctatcaact ttcgatggta ggatagtggc ctaccatggt ggtgacgggt gacggagaat tagggttcga ttccggagag ggagcctgag aaacggctac cacatccaag gaaggcagca ggcgcgcaaa ttacccaatc ctgacacggn gaggtagtga acaataataa caataccggg ctcttcgagt ctggtaatgg gaatgagtac aatctaaatt ccttaac BE036024 gcacgagcga cgcgggcttt gcccgttgct ctgatgattc atgataactc gacggatcgc SEQ ID NO: 119 acggcctttg cgccggcgac gcatcattca aatttctgcc ctatcaactt tcgatggtag gatagtggcc taccatggtg gtgacgggtg acggagaatt agggttcgat tccggagagg   gagcctgaga aacggctacc acatccaagg aaggcagcag gcgcgctaat tacccaatcc tgacacgggg aggtagtgac aataaataac aataccgggc tcttcgagtc tggtaattgg aatgagtaca atctaaatcc cttaacgagg atccattgga gggcaagtct ggtgccagca gccgcggtaa ttccagctcc aatagcgtat atttaagttg ttgcagttaa aaagctcgta gttggacctt ggggtgggcc gaccggtccg cctagcggtg tgcaccggtc gtcctgcctc ttctgccggc gatgcgctcc tggccttaac tgggccggtc gtgccaccgg gcgctgtact ttgaagaaat agagtgctca agcaggccta cgctctggat acattagcat gggataacat cataggaatt ccgtcctatt ctgttgccct tcggnattcg agtaattgat aacaggnnac agcgggggca ttcgtatttc atagtcagag gtgaaaatct tggattattg aagaccaaca actgccaaag catttggcca ggatgttttc attattcaag accgaaagtt ggggcttcga agaccaacag attcccgtct aatcttaaac cttaaacata tcccaccagg ggatcgggga tgtaactttt aggaccccgc cggcccctta tgagaaatta aagttttggg gtcccggggg gagtttggtg ccaaggcttt aacttaaggg aattgcgcgg aggggccccc cccgggaatg ggccctgt BE035828 gcacgaggtc tcaaagatta agccatgcat gtgtaagtat gaactaattc agactgtgaa SEQ ID NO: 120 actgcgaatg gctcattaaa tcagttatag tttgtttgat ggtacctgct actaggataa ccgtagtaat tctagagcta atacgtgcaa caaaccccga cttctggaag ggatgcattt attagataaa aggtcgacgc gggctttgcc cgttgctctg atgattcatg ataactcgac ggatcgcacg gcctttgcgc cggcgacgca tcattcaaat ttctgcccta tcaactttcg atggtaggat agtggcctac catggtggtg acgggtgacg gagaattagg gttcgattcc ggagagggag cctgagaaac ggctaccaca tccaaggaag gcagcaggcg cgcaaattac ccaatcctga cacggggagg tagtgacaat aaataacaat accgggctct tcgagtctgg taattggaat gagtacaatc taaatccctt aacgaggatc cattggaggg caagtctggt gccagcagcc gcggtaattc cagctccaat agcgtatatt taagttgttg cagttaaaaa gctcgtagtt ggaccttggg gtgggccgac cggtccgcct agcggtgtgc accggtcggn cttgcctctt ttgtcggcga tgcgctcctg gcctttaact ggccgggttg tgccaccggc gctgttactt ttgaagaaat aagagtgctc aaagcaagcc ctacgctctg gttacattag catgggataa caatatagga tttccggtcc tattttgttg gcctttggga tcggagttat gaataacagg gaccgtccgg gggcatttct tttttaatat tcaaaggtga aat BE035751 agctggtacg cctgcggtac cggtccggaa ttcccgggtc gacccacgcg tccgcggacg SEQ ID NO: 121 cgtgggcgga cgcgtggggc taatacatgc aactcggtct ctaccggaaa tggtagggac gcttttatta gaccaaaacc aatcgggcgt tctcgtccgt tttgccttgg tgactctgaa taaattgtgt gcagatcgca cggtcctcgt accggcgacg catctttcaa atgtctgcct tatcaacttt cgatggtagg tcctgcgcct accatggttg taacgggtaa cggggaatca gggttcgatt ccggagaggg agcctgagaa acggctgcta catccaagga aggcagcagg cgcgcaaatt acccactccc ggcacgggga ggtagtgacg acaaataacg atacgggact catccgaggc cccgtaatcg gaatgaacac actttaaatc ctttaatgag tatccattgg agggcaagtc tggtgccagc agccgcggta attccagctc caatagcgta tattaaagtt gttgcggtta aaaagctcgt agtcggactt gtgtcacacg ctgccggttc accgcccgtc ggtgctaact ggcatgcacg tgttgacgtc ctgctggtgg ccgtagccgg tccgggtgtt ctgggatccc ttcggngttt cccggacccc ggtgcttggt gaaggcctac ttgacctacc cgtcgcggtg ctcttaaccg agtgtctcga tgggccggca cttttacttt gaacaattag agtgcttaaa gcaggcagta tcagccctga tactgagtgc atggaataat ggaataggaa cctcggtcta tttntgtggt tttcggaatg ccctagatcg cgagcggccg ctctagaaga tccaagctta cgtacgcctg cattgccaag tataagcttt tttatatggg gaaccctaaa ttcaatcaac tggcgcgcgg tttaacacac gcggag BE019724 tgctgctcct tctagtgtgg gtggctgtag taggggaggc tcagacaagg attgcatggg SEQ ID NO: 122 ccaggactga gcttctcaat gtctgcatga acgccaagca ccacanagga aaattctttc cccgaggaca agttgcatgt tctgtggggg ccctggagga agaatgcctg ctgttctacc aacaccagcc aggaagccca taaggatgtt tcctacctat atagattcaa ctggaaccac tgtggagaga tggcacctgc ctgcaaacgg catttcatcc aggacacctg cctctacgag tgctccccca acttggggcc ctggatccag caggtggatc agagctggcg caaagagcgg gtactgaacg tgcccctgtg caaagaggac tgtgagcaat ggtgggaaga ttgtcgcacc tcctacacct gcaagagcaa ctggcacaag ggcctggaac ctggacttca gggttttaac aaggtgcgca ggtgggaggc tgccctgccc accttttcca ttttctactt ctctcacacc cactgttgct gttgcattgc aaatcttgtc ctcacttctt acaaggtaca gcaactacca agaaaaa AW913291 aggacatttc ctacctgtac cggttcaact ggaaccactg cggaactatg acatcggaat SEQ ID NO: 123 gcaaacggca ctttatccaa gacacctgcc tctatgagtg ttccccgaac ttgggaccct ggatccagca ggtggaccag agctggcgca aagagcggat ccttgatgtt cccctgtgca aagaggactg tcagcagtgg tgggaggact gccagagctc ttttacctgc aagagcaatt ggcacaaggg atggaactgg tcctcggggc ataacgagtg tcctgtggga gcctcctgcc atcccttcac cttctacttc cccacatctg ctgctctgtg tgaggaaatc t AW912445 gcggccgctc cctcgactgt agttgctgag cttgtaggag tgactccaga tttcctcaca SEQ ID NO: 124 cagagcagca gatgtgggga agtagaaggt gaagggatgg caggaggctc ccacaggaca ctcgttatgc cccgaggacc agttccatcc cttgtgccaa ttgctcttgc aggtaaaaga gctctggcag tcctcccacc actgctgaca gtcctctttg cacaggggaa catcaaggat ccgctctttg cgccagctct ggtccacctg ctggatccag ggtcccaagt tcggggaaca ctcatagagg caggtgtctt ggataaagtg ccgtttgcat tccgatgtca tagttccgca gtggt AW823912 gggctgtgga cgaagactgt agagactacc cagagtctga cctagggaga ggccaactcg SEQ ID NO: 125 gataccccta tgtgcgctcc cagaagctaa ggacattgag acagaaagac atggcctgga aacagacacc actcttgctt ttggtctaca tggtcacaac aggcagtggc cgggacagaa cagacctact caacgtttgc atggatgcca aacaccataa gacaaagccg ggccccgagg acaagctgca tgaccagtgt agtccatgga agaaaaatgc ctgttgctca gtcaacacca gccaggagct acacaaggct gactcccgtc tgtacttcaa ctgggatcac tgtggcaaga tggagcctgc ctgtaagagt cacttcatcc aagactcctg cctgtatgag tgctccccca accttgggcc ttggatccag caagtggacc agagttggcg taaagagcgt gtcctggatg tgcccttatg caaagaggac tgtcaccagt ggtgggaagc ctgtcgtacc tnctttacct gcaagagaga ctggcataaa ggctgggact ggtcctcagg catttacaag tgcccaaaca cagcaccctg tcacacgttt gagtactact tcccgacacc agccagccct tgc AW823418 tttttttttt ttcccaaatg tgtcatagat ttattgaaaa tgattgtaga aaccccaacc SEQ ID NO: 126 cccaacatgg atcaggaact ggatgacaga ggaaaggggt ggaagcaggc tcaagacttc cttattcttt gaagagttga accaaaccaa gggtagatag aggagaaatc tagagaggaa gactgacctc tcagccaggg agccataatg acagcactgg ggccaggctg ggcacaaaaa gtactgctgc atggggcaca gtcccagatg tcataaagga agcataaaac ttcaccacgt cctcattcgg attgccctgg gttgagtcaa accacatttg gatgcagcgg ccactccctc tgctgtagtt gctgaccttg taggagtgac tccagagacc ctcgcaaagg ctggctggtg tcgggaagta gtactcaaac gtgtgacagg gtgctgtgtt tgggcacttg ttaatgcctg aggaccagtc ccagccttta tgccagtctc tcttgcaggt aaaggaggta cgacaggctt cccaccactg gtgacagtcc tttttgcata agggcaccat ccagaaaacg ctctttacgc caactcttgt tccacttgct gatccaaagg ccaaagttgg gggagcact AB023803 cagcctcttg cacacagctt tactctgtca gccccagggt ggaaacaaag ggctggctgt SEQ ID NO: 127 tcatcacact gcactttgtg taatcactcg ctctcacaac tggcaaatct cttttgccag tggtgggact gaataacatt ttaaagggat gaagtacagc acagagctgt acaagatagt ggatgactgc agactttttc ataattttgt accatttcta aaaaagtgat gtttctcaaa ttactacaag ttgattttaa ctccattctt tttaaaatgt gattgatgtg tgtttctcat tttacacaca gatgtatgca aatgggaccg acatgtgcca gagtatgtgg ggggaatcct ttaaggtgag cgaatcctcc tgcctctgct tgcaaatgaa caagaaggac atggtggcaa tcaagcacct cctctccgaa agctcagagg aaagctccag tatgagcagc agtgaggagc acgcctgcca aaagaaactc ctgaagtttg aggcactgca gcaagaggaa ggggaagaga gaagatgaat tttggtggat gaatatcagg aggagaggaa tcattgtgga ggttgtgctc ggggcatcac agcagcctgt cttatccctc acttctgaga acacaataaa tcaatggttg gctatatt AB022344 acaagcagat taatttcatt agcacgcatc accatatata ataaagctgt aataggccaa SEQ ID NO: 128  atgctccaat ttacacttgt gaaactccgt ctcactccag ccacactgtt gttacacttt catgatgcca aggagggaaa cagatctggc agctgtcaca agttggaagt acaaacaatt tttcccttca ccactacagc tttgcagagt taacaaaaat ataaaaccag aaaagcttac ttcagtcatt agagagatct gcctcactaa aaagggatca ctgtgttgag ttaggagatg tcagtttgac atagatacta actcaatggc cagaagctgt gaagttagca actagctgga gttcttgtat ctcttttgca tttttttccc tcattaccca atggtagctc ttgcagaagg aattcatgca ggcaggtagc ggctcctgag agctcaaata gctgcgtctg tgatttcgga ataaatacat ccttctgcta acatcgctgg ccattatcag atagtcagat gataatgtaa taataataat gtacccgtgc cagaattact gtctgtggca atatctgtaa catcatgcat gctttaacgc tgtataaaaa ctttgagaag atgaatataa gttcataggg caatgatatt aatgttaaaa ataaatgata acaggagttt tatcagtaca aaaatatgag cgagtacttg caaataaatt cagcattaac aaatgaggtt aacaacccat tcaagtattg aaagcaataa gaaacattct ttaataaatt tctcaatata agacttacgg tcttatactg agacttttct tactcagaat aagaaaaaga agactcaaga tgatgaagat gtgtggctga aatctctaga agctcctgtg cttgagcctg cctacatcta ttgttaacca aagccaagtc tgagaaatca caaacatatg acaattttcc ttcctgctgt tagaaattct gcctaatctc ccagcaagtg gtcccatttg gctcatattc aaagcttgaa aaagatccca gtctcctata gcttaatata atttgtatgt caattccata aacaaaggca ttacatgaaa cctcctggct cctaacacct ttacaagagt gaatacattt catacaaacc aagcagtaag gaacagaaca cgtgcttttt caccaggctg gctagcacag ccactcatcc tcagattgaa aggggatgtt tatgtggcac agtggtcttt actttgtatg aatacactga tcttagtacc aagcaatatg cacaagtcct ttacactaca aatcagcaag aagctccatt aatttcagcc agcacaaaat caagccacat gaagtgaagg cacaggcaat aaggtcttac atttacttca gtttctccta tacttatatt atgtctcttt gtatttgttt taattaaatt cactctggaa agcagaaaac actagggttt caaatgatct gaaaatggtc ttgtaaaggc agcagcactt ttgcctcaag gaaggcttca gccagagcag gaattggtgc ttacagctca gcagagatca ttatcatact gtgagtttgc tcagtgagat tcattccaca cttccactgt gccagtgttt gttttattca agcaaaaaag ttttgtaaat actgacccac agttactatt tgacaaacca ctgttgtgtt ttaaaataga aacaagagat gctattttcc atttgcatct gaataattgc aaagtagtca gtggcgtgtt gctagttagg gagctcactg ggatttgacc tatggaagta agtgcaccta tttgtaatga ccacgtctgc tttctgtgat ggtccatgtt cagatgtgga atcccctctg cagaaagcac acctggtaag gaaatccagt cagcaactgc tgtcagtggt actcgcaaca gtttctccta gtgtttgtga cacccttgga aagcacaaac atggcaggta gagaaagaag gacaaacatc agcaggttaa aaaaagaatc ttctgggcaa agagcaaagg cctgagaatc aggagaccaa ttctccttca ggggctgcag taaaattact gagtaaccca aagcaaactg atatgtttac ggtcaaaatt aagccagaca ggttgaaata tgaagagttg tttgaaatgt ctataattca gtgaagttgg tgataagaac ccaattaagc tgttgtagaa atgaatctaa taattataac aaaaggaatc attgcaaaat caggcagggg gtgggaggta gttgtattgg gtacactgga gagctgttgt ttttctaatt ctagtctatg tttgtacttt cctgtttatt atgtccacat ttgcaagcaa taaaagggca ttatgtgctg gtcattccat ctgcttttga gataaatcta tgttagcatt tcaaagggtc aaggaactct ccagggcaaa caaattctgg agcgctgctg ccagatggcg cgtatataaa gtggaaagcg agaaaagcaa tttgctgtgt ttctgttcca gggagaagtc tcacccagaa ggacagcaaa agaggtgaga aactaccgag aaattgtaca ggcggttttc ttctgtaaca tgttgctttc tttgcatctg aaaagtttag gtacggagag aagctcagtt cttgttcagg caaagctctt ccaaaaaggt atcaggaata tttaaccaaa gaattgaagg ttaagttaat aacacctata aagaattatg cacttcttta tgtgggaggt tctagattta tctgtataac tcactaatat gtagtctgta cttacagaaa ctctatgctc gcagaccaaa tggtggttat cttgcatatt tgactgaact ctacaaaagc agacacaaaa ccattgatca gattattagg ttcaaataag cgtgacctca acaaaggcaa gttatctgca taatttatcc agctcaattg ccaccttatg ctctgctatt agcttgtcaa ttctgtaaac agaagcactg caattaaatg ggtaatttcc cagcacacaa aagaactctg taagtttcgg agctgatcaa tcttgccttc aaatctagtg tagcagtggg atgggaaatc catatctgca tgagaaattt aaaaaccttt tgttaaatac tgaaaaccat aacatatagc cttcattctt catatagcct gtattcttca taggtcacca gaaactgaaa atatgtagca gaagcattaa gtgtttggac atgagcaaag gaaagggaga atgagtgacc caatatttat atgcgtacct ctcttgagca tatttaattg tatatatatg tagctttttt acagcagccc ttctttttac 3420 tatcaggact tttcctacaa ataaaggata tcagtaaaga cttctctccg cacaggaaaa 3480 AW475385 tttccccagt cagctggctg atctggaagt ataaacaaga aaggaggctg acggctctag SEQ ID NO: 129 aagtccccaa cctgttgtga tcttcagtag acaaacactc ctggtgtgtc acaggattca ggccactaaa cctcggccgg ctgtctcctg gaatgaagaa agcaaaggaa gcctagagtg gagacaaaga aacccgaggc actctgagag ctgccatctt atccttgttt gccgcctgac acttctcagc aggatccaca taccctaagg agtggaagac tccttggcgc ttggtgcttc aaccggactg acttcctggg cctggagttg gcgattagag gtctgacatg gctcacctga tgactgtgca gttgttgctc ctggtgatgt ggatggccga atgtgctcag tccagagcta ctcgggccag gactgaactt ctcaatgtct gcatggatgc catacaccac agagaaaaac cgggccctga tgacaattta cacgaccagt gcagcctctg gaaacgaatt cctgctgttc cacgaacact agccatgaag cacataagga catgtcctac ctgttccaga tcaactggaa ccactgcggg actatgacat cggaatgcag actgcactgt atgcaagaca cctgcctcta tgagtgtaca cagaacttgg gacgctggat tcatctagtg aaccaaagct ggc AW323586 cacctgatga ctgtgcagtt gttgctcctg gtgatgtgga tggccgaatg tgctcagtcc SEQ ID NO: 130 agagctactc gggccaggac tgaacttctc aatgtctgca tggatgccaa acaccacaaa gaaaaaccgg gccctgagga caatttacac gaccagtgca gcccctggaa gacgaattcc tgctgttcca cgaacacaag ccaggaagca cataaggaca tttcctacct gtaccggttc aactggaacc actgcggaac tatgacatcg gaatgcaaac ggcactttat ccaagacacc tgcctctatg agtgttcccc gaacttggga ccctggatcc agcaggtgga ccagagctgg cgcaaagagc ggatccttga tgttcccctg tgcaaagagg actgtcagca gtggtgggag gactgccaga gctcttttac ctgc AW319308 caacaaccca ttcaaacatc taccctatca actttcaataa tagtcacca tacctaccat SEQ ID NO: 131 aataaccacg aataacaaaa aatcataatt caattccaaat aagaatcct aagaaactac taccacatcc aaataataca gcatacactc aaattacccac tcccgaccc aagaaaattt aacgaaaaat aacaatacaa tactctttcg aagccctataa ttaaaataa atccacttta aatcctttaa cgaagatcca ttngagaaca attctgctgat atcac AW239668 aagattatgc ctcccccnaa ttcggcacga ngcggggagcg agcggnccc cctccctgtc SEQ ID NO: 132 cgtctcctgg tcggggtcct tttttaataa cgcgtaaacct atccaangg tacacaacga agaagcttgg acaaaaggcg gaaaagcgtc ttgccaaaagg gggactgga ngtnaactgg aaaaaaacta attttccaag agaagaactt ggnagaanggg gaattgngt ttcnggggtg nccttctcgn tctccggggn cgnanttctg natncgcaaca agcaaggac caatccaatc ccgggnacgc gggcggnccc anccgcgaag nttttcanncc cganaatcc aaacaatcct ggccnaagaa atatgccctt gngtaacaaa ccntcccaatt tttttaata tatcccaaan tnttattatt aaaacaaatg ctnaaanccc tccactcccnn anggttaaa naaatggggt ccnnttggca ccaactttaa tgggangttt gggnttanaaa naaacaccc cttccntttt cccggggngc gttatttggg gnngcacccc ccccgcncttt aattttgtt AV253136 atgggaatat cccccataca atagtacttc ttgtgcccaat ctggcccca gtgccgtcat SEQ ID NO: 133 tatgggtccc tgcgtgagag gtcattttct tctttagattt ttcctctat ttacccttgg tctggttcaa ttcttcaaag aataaggaag ttttgagcctg cttccaccc ctttcttctt tcatccagtt cctgatccat gttgggggtt ggggtttctac aatcatttt caataaatct atgacac AW013716 gaattcggca cgagccagta gcatatgctt gtctcaaagn ttaagccatg caagtctaag SEQ ID NO: 134 tacacacggc cggtacagtg aaactgcgaa tggctcatta aatcagttat ggttcctttg atcgctctca cgttacttgg caattccaga gctaatacat gctaatacat gccaacgggc gctgacctcc ggggacgcgt gcatttatca gacccaaaac ccatgcgggg tgctcctcac ggggtgcccc ggccgctttg gtgactctag ataacctcga gctgatcgct ggccctcgtg gcggcgacgt ctcattcgaa tgtctgccct atcaactttc gatggtactt tttgtgccta ccatggtgac cacgggtaac ggggaatcag ggttcgattc cggagaggga gcctgagaaa cggctaccac atccaaggaa ggcagcaggc gcgcaaatta cccactcccg actcggggag gtagtgacga aaaataacaa tacaggactc tttcgaggcc ctgtaattgg aatgagtaca ctttaaatcc tttaacgaag atccattgga gggcaagtct ggtgccagca gccgcnggta attcagctcc aatagcgtat cttaaagttg ctgcaattaa aaagctccgt attggacctc ggatc AW013704 gaattcggca cgagcagtag catatgcttg tctcaaagat taagccatgc aagtctaagt SEQ ID NO: 135 acacacggcc ggtacagtga aactgcgaat ggctcattaa atcagttatg gttcctttga tcgctctcac gttacttgga taactgtggc aattccagag ctaatacatg ccaacgggcg ctgacctccg gggacgcgtg catttatcag acccaaaacc catgcggggt gctcctcacg gggtgccccg gccgctttgg tgactctaga taacctcgag ctgatcgctg gccctcgtgg cggcgacgtc tcattcgaat gtctgcccta tcaactttcg atggtacttt ttctgcctac catggtgacc acgggtaacg gggaatcagg gttcgattcc ggagagggag cctgagaaac ggctaccaca tccaaggaag gcagcaggcg cgcaaattac ccactcccga ctcggggagg tagtgacgaa aaataacaat acaggactct ttcgaggccc tgtaattgga atgagtacac tttaaatcct ttaacgagga tccattggaa ggcaagtctg gtgccagcag ccgcggtaat tccagctcca atagcgtatc ttaaagttgc tgcagttcaa caagcctcgt attggacctc ggattc AW013702 gaattcggca cgaggcggta ttcaggcgac cgggcctgct ttgaacactc taattttttc SEQ ID NO: 136 aaagtaaacg cttcggaccc cgcgggacac tcagctaaga gcatcgaggg ggcgccgaga ggcaggggct gggacagacg gtagctcgcc tcgcggcgga ccgtcagctc gatcccgagg tccaactacg agctttttaa ctgcagcaac tttaagatac gctattggag ctggaattac cgcggctgct ggcaccagac ttgccctcca atggatcctc gttaaaggat ttaaagtgta ctcattccaa ttacagggcc tcgaaagagt cctgtattgt tatttttcgt cactacctcc ccgagtcggg agtgggtaat ttgcgcgcct gctgccttcc ttggatgtgg tagccgtttc tcaggctccc tctccggaat cgaaccctga ttccccgtta cocgtggtca ccatggtagg cacaaaaagt accatcgaaa gttgatangg cagacattcg aatgagacgt cccgccacga aggccagcga tcagctcgag gttatctana gtcaccacag cggccggggc cacccgttga ggaaccaccg ccgcattggg ggttttgggt ctgaataaat tgcac AW013696 gaattcggca cgaggctnga cctccgggga cgcgtgcatt tatcagaccc aaaacccatg SEQ ID NO: 137 cggggtgctc ctcacggggt gccccggccg ctttggtgac tctagataac ctcgagctga tcgctggccc tcgtggcggc gacgtctcat tcgaatgtct gccctatcaa ctttcgatgg tactttttgt gcctaccatg gtgaccacgg gtaacgggga atcagggttc gattccggag agggagcctg agaaacggct accacatcca aggaaggcag caggcgcgca aattacccac tcccgactcg gggaggtagt gacgaaaaat aacaatacag gactctttcg aggccctgta attggaatga gtacacttta aatcctttaa cnaggatcca ttggagggca agtctggtgc catcagccgc ggtaattcca gctccaatan cgtatcttaa agttggctgc acttaaaaag ctcntanttg gacctcggga tccnagctga cggtccgccg ctaagcgaac ttaccgtctg tc gggtgccccg gccgctttgg tgactctaga taacctcgag ctgatcgctg gccctcgtgg cggcgacgtc tcattcgaat gtctgcccta tcaactttcg atggtacttt ttgtgcctac catggtgacc acgggtaacg gggaatcagg gttcgattcc ggagagggag cctgagaaac ggctaccaca tccaaggaag gcagcaggcg cgcaaattac ccactcccga ctcggggagg tagtgacgaa aaataacaat acaggactct ttcgaggccc tgtaattgga atgagtacac tttaaatcct ttaacgagga tccattggag ggcaagtctg gtgccagcag ccgccggtaa ttccagctcc atagcgtatc ttaaanttgc ctgccagtta aataagcctc AW013647 gaattcggca cgagtggccg tccctcttaa tcatggcccc agttcagaga agaaaaccca SEQ ID NO: 139 caaaatagaa ccggagtcct attccattat tcctagctgc ggtattcagg cgaccgggcc tgctttgaac actctaattt tttcaaagta aacgcttcgg accccgcggg acactcagct aagagcatcg agggggcgcc gagaggcagg ggctgggaca gacggtagct cgcctcgcgg cggaccgtca gctcgatccc gaggtccaac tacgagcttt ttaactgcag caactttaag atacgctatt ggagctggaa ttaccgcggc tgctggcacc agacttgccc tccaatggat cctcgttaaa ggatttaaag tgtactcatt ccaattacag ggcctcgaaa gagtcctgta ttgttatttt tcgtcactac ctccccgagt cgggagtggg taatttgcgc gcctgctgcc ttccttggat gtggtagccg tttctcaggc tccctctccg gaatcgaacc ctgattcccc gttacccgtg gtcaccatgg taggcacaaa ccgtaccatc gaaagttgat agggcagaca ttccgaatga gacgtcgccg ccaccgaggg ccagcggatc tagctcgagg ttatctagag tcaccaaaag ccggccgggg caccccgtga ggaacacccc gccattggg AW013501 gaattcggca cgaggtcagc tcgatcccga ggtccaactg cgagcttttt aactgcagca SEQ ID NO:140 actttaagat acgctattgg agctggaatt accgcggctg ctggcaccag acttgccctc caatggatcc tcgttaaagg atttaaagtg tactcattcc aattacaggg cctcgaaaga gtcctgtatt gttatttttc gtcactacct ccccgcgtcg ggagtgggta atttgcgcgc ctgctgcctt ccttggatgt ggtagccgtt tctcaggctc cctctccgga atcgaaccct gattccccgt tacccgtggt caccatggta ggcacaaaaa gtaccatcga aagttgatag ggcagacatt cgaatgagac gtcgccgcca cgagggccag cgatcagctc gaggttatct agagtcacca aagcggccgg ggcaccccgt gaggagcacc ccgcatgggt tttgggtctg ataaatgcac gcgtccccgg aggtcagcgc ccgttggcat gtattagctc tggaattgcc acagttatcc aagtaacgtg agagcgatca aaggaaccat aactgattta atgagccatt cgcagtttca ctgtaccggc cgtgtgtatt agacttgcat ggcttaatct ttgagacaag catatctcgt gccgaattc AW013484 gaattcggca cgaggcccta tcaactttcg atggtacttt ttgtgcctac catggtgacc SEQ ID NO: 141 acgggtaacg gggaatcagg gttcgattcc ggagagggag cctgagaaac ggctaccaca tccaaggaag gcagcaggcg cgcaaattac ccactcccga ctcggggagg tagtgacgaa aaataacaat acaggactct ttcgaggccc tgtaatgga atgagttacac tttaaatcct ttaacgagga tccattggag ggcaagtctg gtgccagcag ccgcggtaat tccagctcca atagcgtatc ttaaagttgc tgcagttaaa aagctcgtag ttggacctcg ggatcgagct gacggtccgc cgcgaggcga gctaccgtct gtcccagccc ctgcctctcg gcgccccctc gatgctctta gctgagtgtc ccgcggggtc cgaaacgttt actttgaaaa aattagagtg ttcaaagcag gcccggtcgc ctgaataccg catctaggaa taatggccta ggactccggt tctattttgt gggttttctt ctctgaactg gggccatgat taagaaggac ggccgggctc gtgccgaatt c  AW013428 gaattcggca cgaggtgccc ttccgtcaat tcctttaagt ttcagctttg caaccatact SEQ ID NO: 142 ccccccggaa cccaaagact ttggtttccc ggacgctgcc cggcgggtca tgggaataac gccgccggat cgctagttgg catcgtttac ggtcggaact acgacggtat ctgatcgtct tcgaacctcc gactttcgtt cttgattaat gaaaacattc ttggcaaatg ctttcgcttt cgtccgtctt gcgccggtcc aagaatttca cctctagcgg cacaatacga atgcccccgg ccgtccctct taatcatggc cccagttcag agaanaaaac ccacaaaata gaaccggagt cctattccat tattcctagc tgcggtattc aggcgaccgg gcctgctttg aacactctaa ttttttcaaa gtaaacgctt cggaccccgc gggacactca gcctcgtgcc gaattc AW013404 gaattcggca cgaggctgcg gtattcaggc gaccgggcct gctttgaaca ctctaatttt SEQ ID NO: 143 ttcaaagtaa acgcttcgga ccccgcggga cactcagcta agagcatcga gggggcggaa ttcggcacga gctgggacag acggtagctc gcctcgcggc ggaccgtcag ctcgatcccg aggtccaact acgagctttt taactgcagc aactttaaga tacgctattg gagctggaat taccgcggct gctggcacca gacttgccct ccaatggatc ctcgttaaag gatttaaagt gtactcattc caattacagg gcctcgaaag agtcctgtat tgttattttt cgtcactacc tccccgagtc gggagtgggt aatttgcgcg cctgctgcct tccttggatg tggtagccgt ttctcaggct ccctctccgg aatcgaaccc tgattccccg ttacccgtgg tcaccatggt aggcacaaaa agtaccatcg aaagttgata gggcagacat tcgaangaga cgtcgccgcc acgagggcca gcgatcagct cgaggttatc tagagtcacc aaagcggccg gggcaccccg tgaggagcac cccgcatggg ttttgggtct gataaatgca cgcgctctct ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct ctctccctcg tgccgaattc AW013386 gaattcggca cgaggctgcg gtattcaggc gaccgggcct gctttgaaca ctctaatttt SEQ ID NO: 144 ttcaaagtaa acgcttcgga ccccgcggga cactcagcta agagcatcga gggggcgccg agaggcaggg gctgggacag acggtagctc gcctcgcggc ggaccgtcag ctcgatcccg aggtccaact acgagctttt taactgcagc aactttaaga tacgctattg gagctggaat taccgcggct gctggcacca gacttgccct ccaatggatc ctcgttaaag gatttaaagt gtactcattc caattacagg gcctcgaaag agtc AW013284 gaattcggca cgagtacttg gataactgtg gcaattccag agctaataca tgccaacggg SEQ ID NO: 145 cgctgacctc cggggacgcg tgcatttatc agacccaaaa cccatgcggg gtgctcctca cggggtgccc cggccgcttt ggtgactcta gataacctcg agctgatcgc tggccctcgt ggcggcgacg tctcattcga atgtctgccc tatcaacttt cgatggtact ttttgtgcct accatggtga ccacgggtaa cggggaatca gggttcgatt ccggagaggg agcctgagaa acggctacca catccaagga aggcagcagg cgcgcaaatt acccactccc gactcgggga ggtagtgacg aaaaataaca atacaggact ctttcgaggc cctgtaattg gaatgagtac actttaaatc ctttaacgag gatccattgg agggcaagtc tggtgccagc agccgcggta attccagctc caatagcgta tcttaaagtt gcctcntgcc naatcctgca gccgggggat cc AW013183 cgtccgtctt gggccggtcc aagaatttca cctctagcgg cacaatacga atgcccccgg SEQ ID NO: 146 ccgtccctct taatcatggc cccagttcag agaagaaaac ccacaaaata gaaccggagt cctattccat tattcctagc tgcggtattc aggcgaccgg gcctgctttg aacactctaa ttttttcaaa gtaaacgctt cggaccccgc gggacactca gctaagagca tcgagggggc gccgagaggc aggggctggg acagacggta gctcgcctcg cggcggaccg tcagctcgat cccgaggtcc aactacgagc tttttaactg cagcaacttt aagatacgct attggagctg gaattaccgc ggctgctggc accagacttg ccctccaatg gatcctcgtt aaaggattta aagtgtactc attccaatta cagggcctcg aaagagtcct gtattgttat ttttcgtcac tacctccccg agtcgggagt gggtaatttg cgcgcctgct gccttccttg ga AF061256 gggcggcgac ggtttcctgg tggccgcgcg ctgctctgtg agcggcgggt ggcagacgga SEQ ID NO: 147 cctgggccct caccccagac gcaccgcgga tctggcatgg ctcacctgat gacaatgcag ttgctgctcc tgctgatatg ggtatctgag tgtgcccaat caagagctac tcgggccaga actgaactgc tcaatgtttg catggatgca aagcaccaca aagaaaagcc aggccctgag gacaatttac acaaccagtg cagtccctgg aagaagaatt cctgctgttc caccaacaca agccaggaag cccacgagga catttcctac ctgtaccgat tcaactggga ccactgtgga aagatgacat tggaatgcaa gcgacacttt atccaggata cctgtctcta tgagtgttct cctaacttgg gaccctggat tcagcaggtg gaccagagct ggcgaaaaga gcgaatcctt gatgttcctc tgtgcaaaga ggactgtcag cgatggtggg aggactgccg cacctctttc acctgcaaga gcaactggca caaggggtgg aactggacct cggggtataa ccagtgccct gtgggagcct cctgtcgcca cttcgacttc tatttcccta cacctgctgc tctgtgtgag gaaatctgga gtcactccta caaactcagt aactacagcc gagggagtgg ccgctgtatc cagatgtggt tcgacccagc ccaaggcaac cccaacgagg aagtggcaag gttctatgct gaggccatga gtggagctgg gcttcacggg gcctggccac taatgtgcag cctgtcttta gtgctgctct gggtgttcag ccgagttcct ttaaccttct gatccccagg aactccctgc cgggcttaga ctcccagctc ccaacctcct ttgtggtggg gcctctgaca ggcattcaat atctctctta tgaattattt gggtgtgaat gggaatataa ttattttgca tcctacttac cactgattga agttgtttaa acttggttag ttccctgctc taacacttac tgtgggcaag ttaaataaac ttaattttcc tgtgctgttc cacaaaaaaa aaaaaaaaaa aaaaa A1956572 atcggacgcc ccccgtgtcg gtgacgaccc attcgaacgt ctgccctatc aactttcgat SEQ ID NO: 148 ggtagtcgct A1882550 gtggacgaag actgtagaga ctacccagag tctgacctag ggagaggcca actcggatac SEQ ID NO: 149 ccctatgtgc gctcccagaa gctaaggaca ttgagacaga aagacatggc ctggaaacag acaccactct tgcttttggt ctacatggtc acaacaggca gtggcgggac agaacagacc tactcaacgt ttgcatggat gccaaacacc ataagacaaa gccgggcccc gaggacaagc tgcatgacca gtgtagtcca tggaagaaaa atgcctgttg ctcagtcaac accagccagg agctacacaa ggctgactcc cgtctgtact tcaactggga tcactgtggc aagatggagc ctgcctgtaa gagtcacttc atccaagact cctgcctgta tgagtgctcc ccaaccttgg ccttggatca gcaagtggac agagttggcg taagagcgtt ctggatgtgc A1822932 gagaattagg gttcgattcc ggagagggag cctgagaaac ggctaccaca tccaaggaag SEQ ID NO: 150 gcagcaggcg cgcaaattac ccaatcctga cacggggagg tagtgacaat aaataacaat accgggctct tcgagtctgg taattggaat gagtacaatc taaatccctt aacgaggatc cattggaggg caagtctggt gccagcagcc gcggtaattc cagctccaat agcgtatatt taagttgttg cagttaaaaa gctcgtagtt gctgtcttta ggggactctc actctcctgc ttgtcgttgt gttcttaagg tcttgtcttt attgccggtt gatgtactgc tagtcgtaat tgctctcatt tgccctgtcg tttccgt A1785988 cgccgtgcct accatggtga cc SEQ ID NO: 151 A1744273 gggccccgcg ggacactcag ctaaaagcat cgagggggcgc cgaga SEQ ID NO: 152 A1727302 ctaaatccct taacgaggat ccattggagg gcaagtctggt gccagcagc cgcggtaatt SEQ ID NO: 153 ccagctccaa tagcgtatat ttaagttgtt gcagttaaaaa gctcgtagt tggacttagg ggtgggtcgg ccggtccgcc tcacggtgag caccggtctgc tcgtcccta ctgccggcga tgcgctcctg gccttaattg gccgggtcgt tcctccggcgc tgttacttt gaagaaatta gagtgctcaa agcaggccta cgcttgtata cattagcatgg gataacatc ataggatttc gatcctattg tgttggcctt cgggatcgga gtaatgattaa cagggacag tcgggggcat tcgtatttca tagtcagagg tgaaattctt ggatttatgaa agacgaaca actgcgatag catttgccaa ggatgttttc attaatcaag aacgaaagttg ggggctcga aaacgatcag ataccgtcct agtctcaacc ataaatctcc tccagttccgg aaccacatc ctccgccagt tccagtctat aagaaaacac atccnactcc agttccagtat acaagatac catgtcctcc ccagttccag tctataaatc tcctccggtt ccatt A1725714 ggataaccgt agtaattcta gagctaatac gtgcaacaaac cccgacttc tggaagggat SEQ ID NO: 154 gcatttatta aataaaaggt cgacgcgggc tttgcccgttg ctctgatga ttcatgataa ctcgacggat cgcacggcct ttgtgccggc gacgcatcatt caaatttct gccctatcaa ctttcgatgg taggatagtg gcctactatg gtggtgacggg tgacggaga attagggttc gattccggag agggagcctg agaaacggct accacatccaa ggaaggcag caggcgcgca aattacccaa tcctgacacg gggaggtatt gacaataaata acaataccg ggctctatga gtctggtaat tggaatgagt acaatctana tcccttaacga agatccatt ggagggcaat tctggtgcca ncanccgcgg taattccact cccatancgta tatttaagt gtttgcagtc aaaaagctcg taattggact taggggtggg tcngccggtcc ccctcacgg tgagcacggg tctgctcttc cctactgcgg gcgatgccct cctggccttaa ttg AF137375 ggattcctgc tgcttttgac cacagttctt tctgcaggaca agcatggcc cttgggagag SEQ ID NO: 155 cacgg AF137374 gatgagggag tccaggagtt ccagcaagct cgacctgctta acactccca gacggtcaca SEQ ID NO: 156 ggattcagga caagcatggc ccttgggaga gcacggctgct gctgctctt ggtgtgtgtg gctgtcacat gggcggcccg gcctgatctc ctcaacatctg catggacgc caagcaccac aagaccaagc ccggcccgga agatggcctg catgagcagtg cagcccctg ggagatgaac gcctgctgct ccgtcaacac cagccaagaa gcccataacga catctccta cctgtacaaa ttcaactggg agcactgcgg caagatgaag ccggcctgcaa gcgccactt cattcaagac acctgtctct atgagtgctc gcccaacctg gggccctggat ccaggaggt gaaccagaag tggcgcagag agcggatcct gaacgtgccc ctctgcaaaga ggactgtca gaactggtgg gaagactgcc gcacctccta cacctgcaag agcaactggca cgagggctg gaactggagc tcagggtata accggtgccc cgcgaacgcc gcctgccaccc cttcgactt ctacttcccc acgcctgctg ccctgtgcag ccagatctgg agcaactccta caaacaaag caactacagc cggggcagcg gccgctgcat ccagatgtgg ttcgacccgga acagggcaa ccccaacgag gtggtggcga gatactacgc ccagatcatg agtggcgctgg gctctccga ggcctggcct ctccagttcg gcctggccct gacgctgctc tggctgctgag ctgagcttc tgtcttcgga gagctggaca gccctcccct gttcggcccc acagcacccag ctcgtcagt gcctcagtgg tggtggtagt ggtggtggtg gtggcggcgg ggggactctga ataaaccag tcaccccac AF137373 gacactgctt ccgggtgggc ctccaggagg gccgaggcag aggagcctct gcctgtgggt SEQ ID NO: 157 gaagcactgg ctggcgaact ccggaagggg aggtccggag aggtggtgcc tccccccgca gcaaagctca gactgcactg tcctcaggtg gcagtggtgt cctaccactt ggcacagacc tccacgggcc cttcatcgct tggctccact gtgctgtggg gtaagcggcg cggggaggga cgacgatctg ggcttggaag ggaaacagga aatctggcca agaagcttac ggcagctttc tggcagaagt ggatcaacat ggcctggcgg ctgacgctct tcgtgctcct gggtttggtg gctgctgtgg ggggcgcccg ggccaagtcg gacatgctca atgtctgcat ggatgccaag caccacaagc caaagccaag cccggaggac aagctgcacg accagtgcag cccctggagg aagaactcct gctgctcagt caacaccagc ctagaagccc ataaagacat ctcctacctg tacagattca actgggacca ctgcggcaag atggagccgg cctgcaagcg ccqcttcatt caagacacct gtctctatga gtgctcgccc aacctggggc cctggatcca ggaggtgaac  cagaagtggc gcagagagcg gatcctgaac gtgcccctct gcaaagagga ctgtcagatc tggtgggaag actgccgtac ctcctacacc tgcaagagca actggcacaa gggctggaac tggacctcag ggtataacca gtgcccagtg agcgccgcct gccaccgctt cgacttctac ttccccacgc ccgctgccct gtgcaacgag atctggagcc actcctttga agtcagcagc tacagccggg gcagcggccg ctgcatccag atgtggttcg acccggccca gggcaacccc aacgaggcgg tggcgagata ctatgcagag aatggggatg ctggggccgt ggcccagggg atcgggcctc tcctgaccaa cttgacggag atggtgaaac actgggtcac cggctaagct gttcccccgc cgacccctgc tttccgccca caccccctgg gttactctcc gggtggcctc agcaccccgg tcattggctc ctgatctaag atccgatggg gagcctctga tggcctcttc caatacaata tccacgtg AF096320 ctcagtcgca catagataaa attggccttt atttggagac gggtttgttc ttctatgttt SEQ ID NO: 158 aatcctcggg tgaaatgacc tgaagatatt tgtgtctgtt ttccgcatgg tcaagcaggg agtggagaga ggcctgggct gggccaggtt ttctgggctt tttcctgtgc tccgagtagg tgggttgtat tttacccagt aggagtggaa gactccttgg cgcttggtgc ttcaaccgga ctgacttcct gggcctggag ttggcgatta gaggtctgac atggctcacc tgatgactgt gcagttgttg ctcctggtga tgtggatggc cgaatgtgct cagtccagag ctactcgggc caggactgaa cttctcaatg tctgcatgga tgccaaacac cacaaa AF096319 gctgacggct ctagaagtcc ccaacctgtt gtgatcttca gtagacaaac actcctggtg SEQ ID NO: 159 tgtcacagga ttcaggccac taaacctcgg ccggctgtct cctggaatga agaaagcaaa ggaagcctag agtggagaca aagaagcccg aggcactctg agagctgcca tcttttcctt gtttgccgcc tgacacttct cagcaggatc cacatcccct aagggagtgg agagaggcct gggctgggcc aggttttctg ggctttttcc tgtgctccga gtaggtgggt tgtattttac ccagtaggag tggaagactc cttggcgctt ggtgcttcaa ccggactgac ttcctgggcc tggagttggc gattagaggt ctgacatggc tcacctgatg actgtgcagt tgttgctcct ggtgatgtgg atggccgaat gtgctcagtc cagagctact cgggccagga ctgaacttct caatgtctgc atggatgcca aacaccacaa agaaaaaccg ggccctgagg acaatttaca cgaccagtgc agcccctgga agacgaattc ctgctgttcc acgaacacaa gccaggaagc acataaggac atttcctacc tgtaccggtt caactggaac cactgcggaa ctatgacatc ggaatgcaaa cggcacttta tccaagacac ctgcctctat gagtgttccc cgaacttggg accctggatc cagcaggtgg accagagctg gcgcaaagag cggatccttg atgttcccct gtgcaaagag gactgtcagc agtggtggga ggactgccag agctctttta cctgcaagag caattggcac aagggatgga actggtcctc ggggcataac gagtgtcctg tgggagcctc ctgccatccc ttcaccttct acttccccac atctgctgct ctgtgtgagg aaatctggag tcactcctac aagctcagca actacagtcg agggagcggc cgctgcattc agatgtggtt cgacccagcc cagggcaacc ccaacgagga agtggcgagg ttctatgccg aggccatgag tggagctggg tttcatggga cctggccact cttgtgcagc ctgtccttag tgctgctctg ggtgatcagc tgagctcctg ttttaccttc agttgtctgg agcgccaccc tgcttggctc agcctcccag ctcccagcct cctttgtggt ggggctctga cagcctcttt aataaaccag acattccaca tgtgccttat gaattaaaaa aaaaaaaaaa aaa A1663857 cccgttaaag gatttaaagt ggacctcatc caattacagg gccttgaaag aatcctgtat SEQ ID NO: 160 tgttatattt A1647841 ataaggcaca tgtggaatgt ctggttgatt aaagaggctg tcagagcccc accacaaagg SEQ ID NO: 161 aggctgggag ctgggaggct gagccaagca gggtggcgct ccagacaact gaaggtaaaa caggagctca gctgatcacc cagagcagca ctaaggacag gctgcacaag agtggccagg tcccatgaaa cccagctcca ctcatggcct cggcatagaa cctcgccact tcctcgttgg ggttgccctg ggctgggtcg aaccacatct gaatgcagcg gccgctccct cgactgtagt tgctgagctt gtaggagtga ctccagattt cctcacacag agcagcagat gtggggaagt agaaggtgaa gggatggcag gaggctccca caggacactc gttatgcccc gaggaccagt tccatccctt gtgccaattg ctcttgcagg taaaagagct ctggcagtcc tcccaccact gctgacagtc ctctttgcac aggggaacat caaggatccg ctctttgcgc cagctctggt ccacctgctg gatccagggt cccaagttcg gggaacactc atagaggcag gtgtcttgga taaagtgccg tttgcattcc gatgtcatag tttcgcaggg ttccagttga accggtacag gtaggaaatg tccctatgtg cttcctggct ttgtgtcgtg aacagcagga atcgtcttnc aggggctgcc actgtcgtgt aaattgcctc angggcccgt tttttctttg tgtggtgcat ncatgcagac aatttgaaat cagtcctggc cgagtagctc tg A1646950 aaggcctggt aattaaaaag gctgcaaagc cccacccaaa ggaggttggg agctgggagg SEQ ID NO: 162 ttgacccaac cagggtggcc ctccaaacaa ctgaaggtaa aacaggagct cagttgatca cccaaagcag cattaaggac aggcttgcca aaagtggcca ggtcccatga aacccagttc cattc A1607910 gtagttgctg agcttgtagg agtgactcca gatttcctca cacagagcag cagatgtggg SEQ ID NO: 163 gaagtagaag gtgaagggat ggcaggaggc tcccacagga cactcgttat gccccgagga ccagttccat cccttgtgcc aattgctctt gcaggtaaaa gagctctggc agtcctccca ccactgctga cagtcctctt tgcacagggg aacatcaagg atccgctctt tgcgccagct ctggtccacc tgctggatcc agggtcccaa gttcggggaa cactcataga ggcaggtgtc ttggataaag tgccgtttgc attccgatgt catagttccg cagtggt A1529173 taccacaacc aaagaaagca ttacacgcgc atattaccca ctg SEQ ID NO: 164 A1509734 gagagttgaa cttgccaccc acttcaggga tctctggtac cacaaggtct tgtttctctc SEQ ID NO: 165 tctctcttgg aggcaggcta ctcaggtcta gctactggcg gctctccaca cctgtagctc atagaagctg aaggctgata aaagggcagt gggtggagcg ccctcagccc gctcacctct ttggcatcag gaggagcaac aggagggccc tgccttgaag gtcatggcac agtggtggca gatcctcttg gggttgtggg cagtcctacc caccttggca ggggacaaac tgctcagcgt ctgcatgaat tccaagcgcc acaagcaaga acctggccca gaagacgaac tctaccagga gtgcaggcct tgggaggaca atgcctgctg cacacgttcc acaagttggg aagcccacct tgaggagccc ttgctcttta acttcagcat gatgcactgt ggactgctga ccccggcctg tcgcaaagca ctcattccag nccatttgtt tccatgatgt tcccccaacc tggggccctg gatcccaccc gtgtcc A1506267 acacagagca gcagatgtgg ggaagtagaa ggtgaaggga tggcaggagg ctcccacagg SEQ ID NO: 166 acactcgtta tgccccgagg accagttcca tcccttgtgc caattgctct tgcaggtaaa agagctctgg cagtcctccc accactgctg acagtcctct ttgcacaggg gaacatcaag gatccgctct ttgcgccagc tctggtccac ctgctggatc cagggtccca aagttcggga acactcatag aggcaggtgt cttggataag tgccgttgca ttccgatgtc atagttccgc agtggttcag ttgacccgta cggtaggaat gtcctatgtg cttctggctg tgt A1498269 cccccggggc cggaaggggg aaatttgccc cccggcgccc ttcctgggag ggggaacc SEQ ID NO: 167 A1000444 cggcgaacac catcgaaagt taatagggca gacgttcaaa taggtcgtc SEQ ID NO: 168 AA956337 cggccgctcc ctcggctgta gttgctgagc ttgtaggaat gactccagat tttctcacac SEQ ID NO: 169 agaacagcag gtgtagggaa gtagaaagtg aagggatggc aggaggctcc cacagggcac tcattatgcc ccgaggtcca gttccatccc ttgtgccagt tgctcttgca ggtaaaagag ctcttgcagt cctcccacca cagcacacag tcttctttgc acaggggaac atcaaggatc cgctctttgc gccagctctg gtccacctgc tggatccagg gtcccaagtt cggggaacac tcatagaggc aagtgtcttg gataaagtga cgtttgcact ccggggtcat agttccacag tgattccagt tgaatcggta caggtangaa atgtccttat gtgcttggca tccatgcaga cattgagaag ttcgcctcgt gccgaatt AA955042 tttttttttt tttttttaga tgtgtcatag atttattgaa aatgattgtt gtagaaaccc SEQ ID NO: 170 caagccccaa catggatcag gaactggatg gcagaggaga ggggtggaag caggctcaag acttccttat tctttgaaga gttgaaccaa ccgagaccaa gggtagctag aggagaagac tggcctcagt cagggagcca taatgacagc actggggcca ggctgggcac aagaagtatt gctgcatggt acacagtccc agatgtcata aaggaagcat agaacttcac cacttcctca ttgggattgc cctgggttga gtcaaaccac atctggatgc actggccact ccctctgcta tagttgctga ccttgtanga gtgactccag agaccctcac aaaggctggc tggtgtcggg aaatagtact gaaat AA899838 cggccgctcc ctcggctgta gttgctgagc ttgtaggagt gactccagat tttctcacac SEQ ID NO: 171 agaacagcag gtgtagggaa gtagaaagtg aagggatggc aggaggctcc cacagggcac tcattatgcc ccgaggtcca gttccatccc ttgtgccagt tgctcttgca ggtaaaagag ctcttgcagt cctcccacca cagcacacag tcttctttgc acaggggaac atcaaggatc cgctctttgc gccagctctg gtccacctgc tggatccagg gtcccaagtt cggggaacac tcatagaggc aggtgtcttg gataaagtga cgtttgcact ccggggtcat agttccacag tgattccagt tgaatcggta caggtangaa atgtccttat gtgcttcctg gcttgtgttg gtggagcag AA899718 cggccactcc ctcggctgta gttgctgagc ttgtaggagt gactccagat tttctcacac SEQ ID NO: 172 agaacagcag gtgtagggaa gtagaaagtg aagggatggc aggaggctcc cacagggcac tcattatgcc ccgaggtcca gttccatccc ttgtgccagt tgctcttgca ggtaaaagag ctcttgcagt cctcccacca cagcacacag tcttctttgc acaggggaac atcaaggatc cgctctttgc gccagctctg gtccacctgc tggatccagg gtcccaagtt cggggaacac tcatagaggc aggtgtcttg gataaagtga cgtttgcact ccggggtcat agttccacag tgattccagt tgaatcggta caggtaggaa atgtccttat gtgcttcctg gcttgtgttg gtggagc AA858756 cggccgctcc ctcggctgta gttgctgagc ttgtaggagt gactccagat tttctcacac SEQ ID NO: 173 agaacagcag gtgtagggaa gtagaaagtg aagggatggc aggaggctcc cacagggcac tcattatgcc ccgaggtcca gttccatccc ttgtgccagt tgctcttgca ggtaaaagag ctcttgcagt cctcccacca cagcacacag tcttctttgc acaggggaac atcaaggatc cgctctttgc gccagctctg gtccacctgc tggatccagg gtcccaagtt cggggaacac tcatagaggc aggtgtcttg gataaagtga cgtttgcact ccggggtcat agttccacag tgattccagt tgaatcggta ca A1311561 aaggcccggg aactcccatc aaaagttgtt agggcaaact ttcaaatggg tc SEQ ID NO: 174 A1385951 aattcggatc catgggctga tctggaagta taaacaagaa aggaggctga cggctctaga SEQ ID NO: 175 agtccccaac ctgttgtgat cttcagtata caaacactcc tggtgtgtca caggattcag ctctgtttcc taggccacta aacctcggcc ggctgtctcc tggaatgaag aaagcaaagg aagcctatag tggagacaaa gaagcccgag gcactctgag agctgccatc ttttccttgt ttgccgcctg acacttctca gcaggatcca cataccctaa ggagtggaag actccttggc gcttggtgct tcaaccggac tgacttcctg tgcctggagt tggcgattag actctgcctt cagggtctga catggctcac ctgatgactg tgcagttgtt gctcctggtg atgtggatgg ccgaatgtgc tcagtccata gctactcggg ccaggactga acttctcaat gtctgcatgg atgcctaaca ccacaaagat aaaccgtccc tgaggacatn tacacgacca gtgcagcccc tgcaagacaa ttactgctgt tccactaaca caagccagga agcacataat gacatttcct acctgtaccg tttcactgga accactgctg aactatgaca tcggaatgca tacggcacta tatccaagac acttgctcta tgagtgttcc cccgacttgt gaccctgtat tcagcangtg gaacatgact tgcgcatata cggatccttg atgttcccct gtgcaaagag gactgtcagc attgatgtga tgactgccat agctctttac ctgtcagaac atttgtccat ggtatgtaac tgttcct A1352406 gggtcatttc cacatgcttt attccagcaa tcaaaataat taaaaacatc tcaaattatt SEQ ID NO: 176 atacacatac aaaataggta cagagtcttt tgcttcctcc cacccctagg gggaaaaact gctttgtgct ttgggaagtt gtctctgaaa cccggggaca gaggacgcag gacagactag gagggagccg ggaggatggg ctgcagctgt ggaggagggt ttcagaggag agaggtcgga gagcagaggc ctgagaagcc tgattccccg tcacccgtgg tcaccatggt aggcacggca actaccatcg aaagttgatg ggcaga AF100161 actagttgtc tgttgctgca taacaaatca ttccataatt ttgtggtgta ttgctgcaga SEQ ID NO: 177 caatgttaaa ctaagtggat gaaaaggata ttcacatagt ctcagagtgt ctccctacaa ggtaggatta ctaacaaagg gaaactaata attatatagt aaggaaatct ccttaaccca ataatcacca gcaataagat gcagcaaccc tcatcatgta cctcttgata tgatgcactg acaaaagcac ctctcttctc taattttctt gccaaaatcg ataagctcaa gctaattaca ggaaaatata gacaaaccca aattgaggga cattctgcaa aataactgaa cagtaattct ccaaaagtgt caaggtcata aaagacaaag acattgagga ctgtcacaga ttggagggag actaagggga catgacaact acatgcaacc tggaatcatg gactgaatcc tgggccagag aaggacattg ggggggaact ggtgtaaagg gcataaagct tgtagattag ttaacagtat tgcctcaata ttaatttcct gattttttta agaactgggc tttggttaca taagatgcca atatttgggg aagttgcata aaaacatacg ggaaatcttt tgacgatgtt ttgcagtttt tctgcaaatc taaaattatt tcaaaacaaa aagtttaaaa atcaaataca catagttgct tgaaatagta actattttat tatattccaa gatgttgtga gtcaggaatt tggccaaaac tcaggtgggc gattcttctg caaagacccc cacaacacat tcaaagtcac aggcagaggt tgttggggga gggcattgaa aagaagagaa gagtcatagg tgggtgcaat ggagggaggg cagagggctg ctgactatgt gcaggactca tccataatgg agccctgggg aggcaagggc ttcataacta gacactggtc ttgtcacctc agactcacct gtagcaggac cagatactga ggtcagactg aaaacacagg ctctgcctca ggagaggctc tctactagct gagtaaatga tgacagtatt ggaaatgttc ccaacatcat aatgggaaaa catcacttca cactacataa gcaatacaca ggggcagtgc cggtcgtctt cccaggttag tagcagttct actgcctcca agagtgttgg agaaatacaa accaagcatt aggcactttt aacttgaaaa catgaagttc tctttcctaa ctttctttgt ttccttattt cttcttcttc ttcttcttct tcttcttctt cttcttcttc ttcttcttct tcttcttctt cttcttcttc ttcctcttct tcttcttctt cttctttctt cttcttctcc ttctcctttt ccttcttctt tttttgctga gacagggtct cactctgaca gtacagtggt gccatcacag ctcactgcag cctcgacctc cagggctcaa gcaatcctcc cagctcaccc tcccaaatgg ctgaaactac aagctcgcac caccatacgt ggctaatttt tctatttttg tgtgcagatg aagttttcct atgttgccca agtggtctca aactcctggg atcaagtgat ccatccacct caacctccca aaacgctggg attacaggtg taagccacca cacccagccc actaactttt ttatatcggc taatgaaata gttttaagtt tagaccctac gaggcataaa gaaataattt tagttatgtt atcagatgta cagtaatact caagtgtgca actgtggata acttgagttc atgaggtttt tgtttttttg tcaaaagaat aaatttatag tgaaactacc caaaaaagca aagtacagaa cagtatgcta ccatttgtgc acagaaatgg gatatatatg gtgtaactgc atcgaattta ctggatgtat gtccagggac cagaactctt ggtggcttca tgttcatact tttgcaagca catgtgtagt atccttaact taaaggtact gttgtataca ttctagtgtt atcaaaattt acatacatat tatcaagtca gagaggtcat tctgtgtctt agtattttca cttcatattt ggtatattta tgtatgtata cacacatacc tatatgtatt taaataagat ttatagtcac atggtccaaa aatcaaaaca atgtggaaag gtttacagag aaaagtctca agcctaatcc tgttctctac tgccaggtga ccatgttatt aatttctttt cataccttgc cacagaattt tcacctgcaa acacagatat tcttttcttt tttaatgaca gagtcacgtt ctgttatcca ggctggagtg cagtggcgtg atcttggctc actgcaaact cctcccgggt tcaagtgatt ctcctgtctc agcctcctga gtagctggga ttacaggcat gtgccaccac acccagctaa tttttgtatt tttagtacag atggggtttt atcatattga ccaggctgat gtcgaactcc tgacctcaag tgatccgcct gcctcggcct cccacagtgc tgggattaca ggcgtgagcc accacgccca gtcaacacag acattcttac tcctttttta cagagaattt attattatta ttttttacat agcatttttc tgcacctttc tttttccact taacaatgca cttgaagatt tttccatatt tgtacatcag gagctttctc tttctttgtt accacattaa attccactgg gtagatgtac cataatttaa ctgggtcctt attgaaagac aattgagctg tctcctagac aaagccttgt gcaccttccc gaacagaggg tctaaccaag caggcaggat ggggttataa agtaggtggg gaggtgggag agactccacc ttcccaggtg ggctgagaat ggaggtaagg ccctgcaaca ggacagaggg aaaagtgggg atgagaggtg ggaggcgaga tagcgcccac tgttctcgct cagccccctc ctccgtttgc cgctgacctg ttggcctccc ccaacctctg agcctgcctc tgcctaggta atttcccaag acccagaagg ggtgaagggt gaggtgtgat tgcccccacc tccttgcctc ccgcagcatc tgctccggga ccatgaacaa tagctgacag ctccatggcc cttgctgtcc ccatctcagc ttccctgggc at4Saaacct cagctgccat ggggtaggag gacaggctga ggaagcagaa gcctgaggct gtctagagtc tcactcctgc atcagcaggc caccacctgt ggttcctcct tgtgcaaatt tgaaaagaat tgcataaaac actggagaaa tccaagaggg gaagtccaca agggcggtgg ctccctacaa ggtcacagag caagctggtg tcagagcctg gacctacagc gctgttggtg gaggtcctgc ctccaggtag gggaagggct ccctctcacc tctacacgca gcgcatttct tggctcagct gccctgtagg ggatgcaggg tggggacagc A1326503 aattcggatc catgggctga tctggaagta taaacaagaa aggaggctga cggctctaga SEQ ID NO: 178  agtccccaac ctgttgtgat cttcagtaga caaacactcc tggtgtgtca caggattcag ctctgtttcc taggccacta aacctcggcc ggctgtctcc tggaatgaag aaagcaaagg aagcctagag tggagacaaa gaagcccgag gcactctgag agctgccatc ttttccttgt ttgccgcctg acacttctca gcaggatcca cataccctaa ggagtggaag actccttggc gcttggtgct tcaaccggac tgacttcctg ggcctggagt tggcgattag actctgcctt cagggtctga catggctcac ctgatgactg tgcagttgtt gctcctggtg atgtggatgg ccgaatgtgc tcagtccaga gctactcggg cccagactga acctctcatg tctgatggat gccaaacacc acatagaata accgggccct gaggacaatt tacacgacca gtgcagcccc tggaagacga aatcctgctg ttccacgaac acaagccagg aagcacataa ggacatttcc tacctgtacc ggttcaactg gaaccactgc ggaactatga catcngcaat gcanacggca ctttatccaa gacacctgcc tctatgagtg ttccccgaac ttgggacact gnatccagca ggtgggacca aagcttgcgc caaagagcgg atcccttgat gtttcccctg ggcaaagagg actgtccagc agttgtgggg aggactgcca gaagctcttt tacctgccag agcaatttgc accaggg A1325517 gtagttgctg agcttgtagg agtgactcca gatttcctca cacatagcag cagatgtggg SEQ ID NO: 179  gaagtagaag gtgaagggat ggcaggaggc tcccacagga cactcgttat gccccgagga ccagttccat cccttgtgcc aattgctctt gcaggtaaaa gagctctggc agtcctccca ccactgctga cagtcctctt tgcacagggg aacatcaagg atccgctctt tgcgccagct ctggtccacc tgctggatcc agggtcccaa gttcggggaa cactcataga ggcaggtgtc ttggataaag tgccgtttgc attccgatgt catagttccg cagtggttcc agttgaaccg gtacaggtag gaaatgtcct tatgtgcttc ctggcttgtg ttcgtggaac agcaggaatt cgtcttccag gggctgcact ggtcgtgtaa attgtcctca gggcccggtt tttctttgtg gtgtttggca tccatgcaga cattgagaag ttcagtcctg gcccgagtag ctctggactg a A1325453 acacagtagt tttcagatgt ggggaagtag aaggtgaagg gagggcagga tgctcccaca SEQ ID NO: 180 ggacactcgt tatgccccga ggaccagttc catcccttgt gccaattgct cttgcaggta aaagagctct ggcagtcctc ccaccactgc tgacagtcct ctttgcacag gggaacatca aggatccgct ctttgcgcca gctctggtcc acctgctgga tccagggtcc caagttcggg gaacactcat agaggcaggt gtcttggata aagtgccgtt tgcattccga tgtcatagtt ccgcagtggt tccagttgaa ccggtacagg taggaaatgt ccttatgtgc ttcctggctt gtgttcgtgg aacagcagga attcgtcttc caggggctgc actggtcgtg taaattgtcc tcagggcccg gtttttcttt gtggtgtttg gcatccatgc agacattgag aagttcagtc ctggcccgag tagctctgga ctgagcacat tcggccatcc acatc A1325382 gtggacgaag actgtagaga ctacccagag tctgacctag ggagaggcca actcggatac SEQ ID NO: 181 ccctatgtgc gctcccagaa gctaaggaca ttgagacaga aagacatggc ctggaaacag acaccactct tgcttttggt ctacatggtc acaacaggca gtggccggga cagaacagac ctactcaacg tttgcatgga tgccaaacac cataagacaa agccgggccc cgaggacaag ctgcatgacc agtgtagtcc atggaagaaa aatgcctgtt gctcagtcaa caccagccag gagctacaca aggctgactc ccgtctgtac ttcaactggg atcactgtgg caagatggag cctgcctgta agagtcactt catccaagac tcctgcctgt atgagtgctc ccccaacctt gggccttgga tccagcaagt ggaccagagt tggcgtaaag agcgttttct ggatgtgccc ctatgcaaag aggactgtca ccagtggtgg gaagcctgtc gtacctcctt taccntgcag agagactggc atanaggctg ggactggtcc tcaggcatta acaagtgccc anacacagca ccctgtcaca cgtntgagta ctacttcccg acaccagcca gcctttgcga gggtctctgg agtcactcct acaaggtcag caaactacag cagaggagtg gccgctgcat ccagatgtgg ttgactcacc ccanngcann tcgaaatgag acgtggtgaa gtttatgctt ctttatacat ctgggatgtg cccatgcaca gtact A1323700 acacagagca gcagatgtgg ggaagtagaa ggtgaaggga tggcaggagg ctcccacagg SEQ ID NO: 182 acactcgtta tgccccgagg accagttcca tcccttgtgc caattgctct tgcaggtaaa agagctctgg cagtcctccc accactgctg acagtcctct ttgcacaggg gaacatcaag gatccgctct ttgcgccagc tctggtccac ctgctggatc cagggtccca agttcgggga acactcatag aggcaggtgt cttggataaa gtgccgtttg cattccgatg tcatagttcc gcagtggttc cagttgaacc ggtacaggta ggaaatgtcc ttatgtgctt cctggcttgt gttcgtggaa cagcaagaat tcgtcttcca ggggctgcac tggtcgtgta aattgtgctc atggccctgg tcttctttag tgtgtttagc atccatgcag acatcgagaa gatcagtcct ggtccgagta gctctggact gagcacagtc ngncattcac atcatccaga gcaacaactg cacagtcatc aggtgagcca tgtcagaccc tgatgcagag tctaa A1323374 tgggtcataa attgattgaa aatgattgta gaaaccccaa cccccaacat ggatcaggaa SEQ ID NO: 183 ctggatgaca gaggagaggg gtggaagcag gctcaaaact tccttattct ttgaagagtt gaaccagacc aagggtagat agaggagaaa tctaaagagg aagactgacc tctcagccag ggagccataa tgacagcact ggggccaggc tgggcacaag aagtactgct gcatggggca cagtcccaga tgtcataaag gaagcataaa acttcaccac gtcctcattc ggattgccct gggttgagtc aaaccacatc tggatgcagc ggccactccc tctgctgtag ttgctgacct tgtaggagtg actccagaga ccctcgcaaa ggctggcttg tgtcgggaag tagtactcaa acgtgtgaca gggtgctgtt gttgggcacc ttgttaatgc ctgaggacca gtcccagcct tattgcaatc tttcttgcag gtaaaggagg acgacaggct tccaccactg gtgcagtcct ctttgataag ggacatncag aaacgctctt acgccactct ggtc A1313973 ctatccattc gaacgtgtgc catatcatct tctgatgtac caacccgtgc ctaccatgtg SEQ ID NO: 184 gaccacgggt gactggcaat ccaga A1196928 attccccgnc ccccggggtc accaggggag gcgcggggac taccattaaa agttgatagg SEQ ID NO: 185 gcaaactttt AF091041 gaactagggc ggtatctaat cgccttcgaa cctctaactt tcgttcttga ttgatgaaaa SEQ ID NO: 186 cacctttggc aaatgctttc gctgatgttc gtcttgcgac gatccaagaa tttcacctct aacgtcgcaa tacgaatgcc cccagttatc cctattaatc attacctcgg agttctgaaa accaacnaaa tagaaccgag atcatattct attattccat gcacgaaata ttcaagcagc attttgagcc cgctttgagc actctaattt gttcaaagna aaattgtcgg cccatctcga cactcaccga agagcaccgc gataggattt tgatattgaa ccgacgtttg ttacaacgcc ggctcaccga cnatatgctc cgcagacgtg tcagtatcac cgcggatgcg gtgcaccgac agcncggcgc acaaatgcan ctacnagctt tttaaccgca acaattttag tatacgctat tggagctggg aattaccgcg gctgctggca ccagacttgc cctcaattgn cctcgttaaa atatttaaag tgtctcattc cgattacgaa gcctcg A1156212 cagcgagcct ttgcgggggt gtctggagtg actcctacga ggtgagcgac tacagcagag SEQ ID NO: 187 ggagtggccg ctgcgtccag atgtggtttg agtcagccca gggcgatccc aatgaggacg tggtggagtt ttatgcttcc tttatgacat ctgngactgt gccccatgca gcagtagttc ttgtgcccag cctnngccca gtgctgtcat tatagctccc tggctgagag gtcagtgttc ctctctagat ttcgtcctct atctaccctt ggtgctggtt cagctcttca gagaa A1120374 cagctcacct cctgttttac cttcacttct ctccacgccc caccctcgct tcgcgctcac SEQ ID NO: 188 gcctcccagc tcccacgcct ccttt A1119000 cctcccggct cctgcccgag ggtcgggcgc ctgcggcttt ggtgacttta gattacctcg SEQ ID NO: 189 ggccgatcgc acgccccccg tggcggcg AA408670 gtctctctct ctcttctctt gcttcgctct cttgcttttc tctctctctt gctttttcgc SEQ ID NO: 190 tctcttgctt ctcgctctct cttgcttctt gcnctctttt cctgaagatg taagaataaa gctttgccgc agaagattct ggtctgtggt gttcttcctg gccggtcgtg anaacgcgtc taataacaat tggtgccgaa ttccgggang anaaaatccg ggacgagaaa aaaactccgg antggcgcag gagggatact tcattccagg aancagaact gcgaatcaag gttanaangg atcncgtnac acagattgat tgagaagnnn tccnactggc cgaattcnag aanctcatcg cttggggaa AA408072 ggtttttcga gacagggttt ctctgtgtag ccctggctgt ccttgaactc actttgtaga SEQ ID NO: 191 ccaggctggc ctcgaactca gaaatccgcc tgcctctgcc tcccaagtgc tgggattaaa ggcattcgcc accaccaacc ggcgataaac aaattttata cgaaagaaaa gaagcaagta agattatgag aaacataagc tattttaaga gagtttagag aagatccttc aaatatttta aaagagatct gaataaatca gaaagcatta ttcctggata aataatgggg agagaaataa tagattaana tacaacctat caaaatttaa tc AA407615 cgagacaggg tttctctgtg cagtcctgga actcactctg tagaccaggc tggccttgaa SEQ ID NO: 192 ctcagaaatc cacctgcctc tgcctcccaa gtgctgggat tgcaggcatg cgccaccact gcctggctgc ctggtttttt aattactggc tttagcctaa atggcaaatt ctataactag gttataagaa tagttttaaa agaaagagcc tcaggagagt gggaacagga acatggagaa gtaagaggac acctgggctt tagtcaagat cctgtctaaa acaaaacaga ggggncggna gagctngngc aatggctcag ttggttagag c AA995272 ccgccacggg ggggtcgcga tcggtccgag gttatctaga gtcaccaaag ccgccggcgt SEQ ID NO: 193 cgtcccc C78593 ctctctccag gtattcctac ctaaccttaa cttttcctcg ggttcaagac ccttggaaag SEQ ID NO: 194 gcctgtatac ttattttgtg aaccatattt tctctttgtt cctactcttt cttcccgctt tacttctgat agcttgtcct gaatttcctc tagaattttc agccctatct taaccactat ataacatgtg aaaaggaaca aaagggcttc taacactaga aaaaattcaa ggccaaacat aacttgtaaa gccattttcc actttacttc tgatagactg tcttgaattt ccttagaaag ttcaagatca gacttacctc gttccccagc tgaaaagttc tgaattcata cagttgaatc ctcttaacag tctggcttta cgggaacctt atcaccgtcg ttccccagct ggatgagttc tgaatcggca gttgaatcct tctcaacagt ctgtgttacg ggaaccttat aacctggatt cgcagttcng ggttctggga aggaaagtaa tcccctcctg gcggccagtn ccgggagntt ttttcctcgg tcccgggatt tttcctcggt ccccgggnaa ttcgggcacc caa AA999910 tgggtccgtt cctaaaacaa aaaaaaaaaa acagcggtcc tattccaata ttcctagc SEQ ID NO: 195 AA991491 tgggcagacg ttcgaatggg tc SEQ ID NO: 196 X99994 gacatcgagc tcactcagtc tccagcttct ttggctgtgt ctctagggca gagggccatc SEQ ID NO: 197 atctcctgca aggccagcca aagtgtcagt tttgctggta ctagtttaat gcactggtac caccagaaac caggacagca acccaaactc ctcatctatc gtgcatccaa cctagaagct ggggttccta ccaggtttag tggcagtggg tctaagacag acttcaccct caatatccat cctgtggagg aggaggatgc tgcaacctat tactgtcagc aaagtaggga atatccgtac acgttcggag gggggacaaa gttg X99993 caggtgcagc tgcagcagtc tggagctgag ctggtgaagc ctggggcttc agtgaagata SEQ ID NO: 198 tcctgcaagg cttctggtta ctcatttact ggctacttta tgaactgggt gaagcagagc catggaaaga gccttgagtg gattggacgt attcatcctt acgatggtga tactttctac aaccagaact tcaaggacaa ggccacattg actgtagaca aatcctctaa cacagcccac atggagctcc tgagcctgac atctgaggac tttgcagtct attattgtac aagatacgac ggtagtcggg ctatggacta ctggggccaa gggaccacgg tcaccgtctc c X99992 caggtgcagc tggtggagtc tgggggaggc ttggtacagc cagggcggtc cctgagactc SEQ ID NO: 199 tcctgcacaa cttctggatt cacttttggt gattatgcta tgatctgggc ccgccaggct ccagggaagg ggctggagtg ggtctcatcc attagtagta gtagtagtta catatactac gcagactcag tgaagggccg attcaccatc tccagagaca acgccaagaa ctcactgtat ctgcaaatga acagcctgag agccgaggac acggctgtgt attactgtgc gagagaacga tacgattttt ggagtggaat ggacgtctgg ggcaaaggga ccacggtcac cgtgtcgagt X99991 cagtctgccc tgactcagcc tgcctcagtg tccgggtctc ctggacagtc cgtctccatc SEQ ID NO: 200 tcctgcactg gaaccatcaa tgatgttggt ggatataggt ttgtctcctg gtaccaacga cgccccggca aagcccccaa actcatcatt tctgatgtca ttaggcggcc atcaggggtc cctgatcgct tctctagttc caagtctgac aacacggcct acctgaccat ctctgggctc caggctgagg acgaggctga ttattactgc agctcatata caagcagcag cactctctat gtcttcggaa ctgggaccaa ggtcaccgtc cta X99990 cagtctgccc tgactcagcc tgcctccgtg tctgggtctc ctggacagtc gatcaccatc SEQ ID NO: 201 tcctgcactg gaaccagcag tgatgttggg agttataacc ttgtctcctg gtaccaacag cacccaggca aagcccccaa actcatgatt tatgagggca gtaagcggcc ctcaggggtt tctaatcgct tctctggctc caagtctggc aacgcggcct ccctgacaat ctctgggctc caggctgagg acgaggctga ttattactgc cagtcctatg acagcagcct gagtgtggta ttcggcggag ggaccaagct gaccgtccta cagtctgccc tgactcagcc tgcctccgtg tctgggtctc ctggacagtc gatcaccatc tcctgcactg gaaccagcag tgatgttggg agttataacc ttgtctcctg gtaccaacag cacccaggca aagcccccaa actcatgatt tatgagggca gtaagcggcc ctcaggggtt tctaatcgct tctctggctc caagtctggc aacgcggcct ccctgacaat ctctgggctc caggctgagg acgaggctga ttattactgc cagtcctatg acagcagcct gagtgtggta ttcggcggag ggaccaagct gaccgtccta AA958985 acacagagca gcagatgtgg ggaagtagaa ggtgaaggga tggcaggagg ctcccacagg SEQ ID NO: 202 acactcgtta tgccccgagg accagttcca tcccttgtgc caattgctct tgcaggtaaa agagctctgg cagtcctccc accactgctg acagtcctct ttgcacaggg gaacatcaag gatccgctct ttgcgccagc tctggtccac ctgctggatc cagggtccca agttcgggga acactcatag aggcaggtgt cttggataaa gtgccgtttg cattccgatg tcatagttcc gcagtggttc cagttgaacc ggtacaggta ggaaatgtcc tcctcgtgc AA873222 ttttttttta aattcatgtt tttaattggc ttaatacaaa ggtcccccag gaggccctgg SEQ ID NO: 203 gaggaggggg acagcctggg agaggcagag attcatggcc agcagcccac ccccacctgc cacccactcc ccaacaaggg tcccagactc tttcaataat cctaaaaaaa ccgacgagag cgcaggcaga tgaagagccc cttcatcctc aaacggcgac taccatcgaa agttgatagg gcagacgttc gaatgggtcg tcgccgccac gggggg AA930051 gatccttcga ctccttggcg cttggtgctt caaccggact gacttcctgg gcctggagtt SEQ ID NO: 204 ggcgattaga ggtctgacat ggctcacctg atgactgtgc agttgttgct cctggtgatg tggatggccg aatgtgctca gtccagagct actcgggcca ggactgaact tctcaatgtc tgcatggatg ccaaacacca caaagaaaaa ccgggccctg aggacaattt acacgaccag tgcagcccct ggaagacgaa ttcctgctgt tccacgaaca caagccagga agcacataag gacatttcct acctgtaccg gttcaactgg aaccactgcg gaactatgac atcggaatgc aaacggcact ttatccaaga cacctgcctc tatgagtgtt ccccgaactt gggaccttgg atccagcagg tggaccagag ctggcgcaaa gagcggatcc ttgattgttc ccctgtgcaa agaggactgt catcagtggt gggaggactt gcagagctct tttccctgca agagcaattt ggacaaggga tggaacttgg tctcggggca taacgagtgt cctgtggggc ctccttgcaa tccttcacgt tttatttccc agattggttg gtcttgttgt gaggaatctg gggttcactc ttacagct AA895334 ccagctccaa taacgtatat gagagttgca gcagataagg ggcaagtagt agagtatgga SEQ ID NO: 205 gagagggaga gcg AA796142 ctgtcaccag tggtgggaag cctgtcgtac ctcctttacc tgcaagagag actggcataa SEQ ID NO: 206 aggctgggac tggtcctcag gcattaacaa gtgcccaaac acagcaccct gtcacacgtt tgagtactac ttcccgacac cagccagcct ttgcgagggt ctctggagtc actcctacaa ggtcagcaac tacagcagag ggagtggccg ctgcatccag atgtggtttg actcaaccca gggcaatccc aatgaggacg tggtgaagtt ttatgcttcc tttatgacat ctgggactgt gccccatgca gcagtacttc ttgtgcccag cctggcccca gtgctgtcat tatggctccc tggctgagag gtcagtcttc ctctctagat ttctcctcta tctacccttg gtctggttca actcttcaaa gaataaggaa gtcttgagcc tggttccacc cctctcctct gtcatccagt tcctgatcca tgttggggga tggggtttct acatcatttc aataaactat gaacatctgg gc AA798223 gacatttcct aactgtaacg ggtcaactgg aagcactgcg gaaatatgac atcggaatgc SEQ ID NO: 207 aaacgggact tttttcaaga cacctgcctc tatgagtgtt ccccgaattt ggaccttgat tcagcaggtg gagcaaaact tgcgcaagaa ggggttcctg aagttcccct gtgcaaaaag gactttcaca attggttgga ggatttccaa agctctttta cccgcaagag gaatttgcac aagggtttga acatgtcctc ggggaataa AA734325 attcggatcc ttcaaacctc ggccggctgt ctcctggaat gaagaaagca aaggaagcct SEQ ID NO: 208 agagtggaga caaagaagcc cgaggactct gagagctgcc atcttttcct tgtttgccgc ctgacacttc tcagcaggat ccacataccc taaggagtgg aagactcctt ggcgcttggt gcttcaaccg gactgacttc ctgggcctgg agttggcgat tagaggtctg acatggctca cctgatgact gtgcagttgt tgctcctggt gatgtggatg gccgaatgtg ctcagtccag agctactcgg gccaggactg aacttctcaa tgtctgcatg gatgccaaac accacaaaga aaaaccgggc cctgaggaca atttacacga ccagtgcagc ccctggaaga cgaattcctg ctgttccacg aacacaagcc aggaagcaca taaggacatt tcctacctgt accggttcaa ctggaaccac tgcggaacta tgacatcgga atgcaaacgg cactttatcc aagacacctg cctctatgag tgttccccga acttgggacc ctggatccag caggtggacc agagctggcg caaagagcgg atccttgatg ttcccctgtg caagaggact gtcagcagtg gtgggaggac tgccagagct cttttaccct gcagagcaat tggcacaagg gtggaatggt cccccgggca taacgatttc ccgtggaggc ttctggaatc ccttaacctc taattcccaa tctgcggcct gtgtg AA690871 attcggatcc ttcctggaag tataaacaag aaaggaggct gacggctcta gaagtcccaa SEQ ID NO: 209 cctgttgtga tcttcagtag acaaacactc ctggtgtgtc acaggattca ggccactaaa cctcggccgg ctgtctcctg gaatgaagaa agcaaaggaa gcctagagtg gagacaaaga agcccgaggc actctgagag ctgccatctt ttccttgttt gccgcctgac acttctcagc aggatccaca taccctaagg agtggaagac tccttggcgc ttggtgcttc aaccggactg acttcctggg cctggagttg gcgattagag gtctgacatg gctcacctga tgactgtgca gttgttgctc ctgctgatgt ggatggccga atgtgctcag tccagagcta ctcgggccag gactgaactt ctcaatgtct gcatggatgc caaacaccac aaagaaaaac cgggccctga ggacaattta cacgaccagt gcagcccctg gaagacgaat tcctgctgtt tcacgaacac aagccaggaa gcacataagg acagttccta cctgtaccgg ttcaactggg accactgcgg aactatgaca tcggaatgca aacggcactt tatccagaaa cctgcctcta ttagtgttcc cccacattgg gaccctggat tcaccagtgg gacaaagatg gcgcgaaaaa acgggtcc AA674988 attcggatcc ttcggaacta tgacatcgga atgcaaacgg cactttatcc aagacacctg SEQ ID NO: 210 cctctatgag tgttccccga acttgggacc ctggatccag caggtggacc agagctggcg caaagagcgg atccttgatg ttcccctgtg caaagaggac tgtcagcagt ggtgggagga ctgccagagc tcttttacct gcaagagcaa ttggcacaag ggatggaact ggtcctcggg gcataacgag tgtcctgtgg gagcctcctg ccatcccttc accttctact tccccacatc tgctgctctg tgtgaggaaa tct AA674863 attcggatcc ttcctggaag tataaaccag aaaggaggct gacggctcta gaagtcccca SEQ ID NO: 211 acctgttgtg atcttcagta gacaaacact cctggtgtgt cacaggattc aggccactaa acctcggccg gctgtctcct ggaatgaaga aagcaaagga agcctagagt ggagacaaag aagcccgagg cactctgaga gctggcatct tttccttgtt tgccgcctga caattctcag cagggtccac atatcctaag taagagtggg agactccttt gcgcttggtg cttcaaccgg actgaattcc tgggcctgga attggcgatt agaggtccga catggctcaa ctgatgacct tgcaattgtt ggccccggtg atgtggatgg gcgaaagtgc ttcagttcaa gaagctactt cgggccaagg actgaaactt tctcaaatgt AA674821 gaagactcct tggcgcttgg tgcttcaacc ggactgactt cctgggcctg gagttggcga SEQ ID NO: 212 ttagaggtct gacatggctc acctgatgac tgtgcagttg ttgctcctgg tgatgtggat ggccgaatgt gctcagtcca gagctactcg ggccaggact gaacttctca atgtctgcat ggatgccaaa caccacaaag aaaaaccggg ccctgaggac aatttacacg accagtgcag cccctggaag acgaattcct gctgttccac gaacacaagc caggaagcac ataaggacat ttcctacctg taccggttca actggaacca ctgcggaact atgacatcgg aatgcaaacg gcactttatc caagacacct gcctctatga gtgttccccg aacttgggac cctggatcca gcaggtggac cagagctggc gcaaagagcg gatccttgat gttcccctgt gcaaagagga ctgtcagcag tggtgggagg actgccagag ctcttttacc tgcaagagca attggcacaa gggatggaac tggtcctcgg ggcataacga gtgtcctgtg ggagcctcct gccatccctt caccttccta cttcccaaca tctgctgctc tgtgtggagt aatctggagt cactcctcaa gctcagcaac tacagttcga gg AA674744 cgggccctga ggacaattta cacgaccagt gcagcccctg gaagacgaat tcctgctgtt SEQ ID NO: 213 ccacgaacac aagccaggaa gcacataagg cactttccta cctgtaccgg ttcaactgga accactgcgg aactatgaca tcggaatgca aacggcactt tatccaagac acctgcctct atgagtgttc cccgaacttg ggaccctgga tccagcaggt ggaccagagc tggcgcaaag agcggatcct tgatgttccc ctgtgcaaag aggactgtca gcagtggacg gaggactgcc agagctcttt tacctgcaag agcaattggc acaagggatg gaactggtcc tctgggcata acgagtgtcc tgtgggagcc tcctgccatc ccttcacctt ctacttcccc a AA671558 ctggagctga gcacacactt ggaggttcca cttaccttag ctctgccttc agggtctgac SEQ ID NO: 214 atggctcacc tgatgactgt gcagttgtgg ctgctggtga tgtggatggc cgaatgtgct cagtccagaa ctactcgggc caggactgaa cttctcaatg tctgcatgga tgccaaacac cacaaagaaa aaccgggccc tgaggacaat ttacacgacc agtgcagccc ctggaagacg aattcctgct gttccacgaa cacaagccag gaagcacata aggacatttc ctacctgtac cggttcaact ggaaccactg cggaactatg acatcggaat gcaaacggca ctttatccaa gacacctggc tctatgagtg ttccccgaac ttgggaccct ggattcagca ggtggaccaa agctggcgca agagagggat cctttatgtt cccctggtgc aaagaggact tgtcagcagt tggtgggagg actgccagaa ctcgtgtacc tgccaggagc aattggcaca agggatggaa ttggttcttc ggggcataac gaagtgctct gtgtggagcc tcctgcagtc ctgtaacgtc taattcccac atttggcggt ctgtgtaatg aatctcgggc actccacagg ctc AF000381 acacctgcct ctacgagtgc tcccccaact tggggccctg gatccagcag gtggatcaga SEQ ID N0: 215 gctggcgcaa agagcgggta ctgaacgtgc ccctgtgcaa agaggactgt gagcaatggt gggaagattg tcgcacctcc tacacctgca agagcaactg gcacaagggc tgcaactgga cttcagggtt taacaagtgc gcagtgggag ctgcctgcca acctttccat ttctacttcc ccacacccat tgcccg AF000380 gtgtccccag aagtggcctt gaaccgaata tctccaatgg acagggctgg ggagcccaca SEQ ID NO: 216 gggctggtgc ggcgggagtc agtggaggcg aagatgcaga gtgccagctg gaaggtcaga atacgctcca ccaccatggc ctggccctgc gttgtgttgt tggtagagcg cgttgtctac cctgtaccga agacagaggc tgtggggaca gcctaggggc cctggatcta ttgcctactt agagagaggc caactcagac acagccgtgt atgctcccag cagcaacgga ggttcagcac cgcctgcagg gacagaaaga catggtctgg aaatggatgc cacttctgct gcttctggtc tgtgtagcca ccatgtgcag tgcccaggac aggactgatc tcctcaatgt ctgtatggat gccaagcacc acaagacaaa gccaggtcct gaggacaagc tgcatgacca atgcagtccc tggaagaaga atgcctgctg cacagccagc accagccagg agctgcacaa ggacacctcc cgcctgtaca actttaactg ggaccactgc ggcaagatgg agcccgcctg cagcgccact tcatccagga cacctgtctc tatgagtgct caccaacctg gggccctgga tccagcaggt gaatcagagc tggcggcaaa gaacgcttcc tggatgtgcc cttatgcaaa gagcactgtc agcgctggtg ggaggattgt cacacctccc acacgtgcaa gagcaactgg cacagaggat gggactggac ctcaggagtt aacaagtgcc cagctggggc tctctgccgc acctttgagt cctacttccc cactccagct gccctttgtc aaggcctctg gagtcactca tacaaggtca gcaactacag ccgagggagc ggccgctgca tccagatgtg gtttacttca gcccagggca accccaacga ggaagtggcg aggttctatg ctgcagccat gcatgtgaat gctggtgaga   tgcttcatgg gactgggggt ctcctgctca gtctggccct gatgctgacc ctctggctcc  tcggctgcgt tcagtcctcc cagactacct gccctcagct tggataacca ggctgggctc agctcagctc ccacaaatga cagcccctta agcatgcttc tattagtcac ctaaccctct AA637071 aggattctat gccgaggcca tgagtggagc tgggcttcat gggacctggc cactcttgtg SEQ ID NO: 217 cagcctgtcc ttagtgctgc tctgggtgat cagctgagtt cctgttttac cttcagttgt ctggagcgcc accctgcttg gctcagcctc ccagctccca gcctcctttg tggtggggct ctgacagcct ctttaataaa ccagacattc c AA616314 attaggatcc ttccttctca gcaggatcca cataccctaa ggagtggaag actccttggc SEQ ID NO: 218 gcttggtgct tcaaccggac tgacttcctg ggcctggagt tggcgattag aggtctgaca tggctcaact gatgactgtg cagttgttgc tcctggtgat gtggatggcc gaatgtgctc agtccagagc tactcgggcc aggactgaac ttctcaatgt ctgcatggat gccaaacacc acaaagaaaa accgggccct gaggacaatt tacacgacca gtgcagcccc tggaagacga attcctgctg ttccacgaac acaagccagg aagcacataa ggacatttcc tacctgtacc ggttcaactg gaaccactgc ggaaatatga aatcggaatg caaacggcac tttatccaag aaaccttgac tcaatgagtg ttacacgaaa cttgggacac tggataagca agtggaacag agatgggcga aaagagcgga tacattgatg taaccctgtg acaagaggac tgttcagcag tggtgggagg actgccaga AA109687 aattcggatc catgatctgg aagtataaac aagaaaggag gctgacggct ctagaagtcc SEQ ID NO: 219 ccaacctgtt gtgatcttca gtagacaaac actcctggtg tgtcacagga ttcaggccac taaacctcgg ccggctgtct cctggaatga agaaagcaaa ggaagcctag agtggagaca aagaagcccg aggcactctg agagctgcca tcttttcctt gtttgccgcc tgacacttct cagcaggatg cacataccct aagcagggag tggagagagg cctgggctgg gccaggtttt ctgggctttt tcctgtgctc cgagtaggtg ggttgtattt tacccagtag gagtggaaga ctccttggcg cttggtgctt caaccggact gacttcctgg gcctggagtt ggcgattaga ggtctgacat ggctcacctg atgactgtgc agttgttgct cctggtgatg tggatggccg aatgtgctca gtccagagct actcgggcca ggactgaact tctcaatgtc tgcatggatg ccaaacacca caaagaaaaa ccgggccctg aggacaattt acacgaccag tgcagcccct ggaagacgaa ttcctgctgt tcaacgacac aagcaggaag cactaaggac ttttctactg t AA608235 ttcggatcct tctctggaag tataaacaag aaaggaggct gacggctcta gaagtcccca SEQ ID NO: 220 acctgttgtg atcttcagta gacaaacact cctggtgtgt cacaggattc aggccactaa acctcggccg gctgtctcct ggaatgaaga aagcaaagga agcctagagt ggagacaaag aagcccgagg cactctgaga gctgccatct tttccttgtt tgccgcctga cacttctcag caggatccac ataccctaag gagtggaaga ctccttggcg cttggtgctt caaccggact gacttcctgg gcctggagtt ggcgattaga ggtctgacat ggctcacctg atgactgtgc agttgttgct cctggtgatg tggatggccg aatgtgctca gtccagagct actcggggcc aggactgaac ttctcaatgt ctgcatggat gccaaacacc acaaagaaaa accgggccct gaggacaatt tacacgacca gtgcagcccc tggaagacga attcctgctg ttccacgaac acaagccagg aagcacataa ggacatttcc tacctgtacc ggttcaactg gaaccactgc ggaactatga catcggaatg caaacggcac tttatccaag acacctgcct ctatgagtgt tccccgaact tgggactgga ttcagcaggt ggacc AA589050 tggaagactc cttggcgctt ggtgcttcaa ccggactgac ttcctgggcc tggagttggc SEQ ID NO: 221 atttagaggt ctgacatggc tcacctgatg actgtgcagt tgttgctcct ggtgatgtgg atggccgaat gtgctcagtc cagagctact cgggccagga ctgaacttct caatgtctgc atggatgcca agcaccacaa agaaaaaccg ggccctgagg acaatttaca cgaccagtgc agcccctgga agacgaattc ctgctgttcc acgaacacaa gccaggaagc acataaggac atttcctacc tgtaccggtt caactggaac cactgcggaa ctatgacatc ggaatggcaa cggcactttt atcaaagaca cctgcctcta tgagtgttcc ccgaactttg ggaacctgga ttccagaagt tggacagagc ctgcgcaaaa gagcggattc ttgatggttc cctgtgcaaa gaggactgtc agcagtggtg ggagactgcc aagctcttta cctgcaagag cattggcaca aggatggaat ggtcctctgg caaacga AA544782 atggctccct gatgactgtg cagttgttgc tcctgctgat gtggatggcc gaatgtgctc SEQ ID NO: 222 agtccagagc tactcgggcc aggactgaac ttctcagtgt ctgcatggat gccagacacc acaaagagaa accgggccct gaggacaatt tacacgacca gtgcagcccc tggaagacga attcctgctg ttccacgaac acaagccagt aagcacataa ggacatttcc tacctgtacc ggttcaactg gaaccactgc ggaactatga catcggaatg caaacggcac tttatccaag acagctgcct ctatgagtgt tccccgaact tgggagcctg tatgcagcag gtggacgaga gctgtcgcaa agagcggatc cttgatgtgc ccctgtgcaa agaggactgt cagcagtggt gcgagtgctg cggagctctt gtacctgcag agaggaattt gcacagggga tggaactggt tccctggggc ataacaagtg tcctgtggta gcctgccggc aggccgttag cgttgtagtt tcgcggatcg gctggtcggg tgaagaagtt gtggggcatg ccacatgtca gtagtttgtt AA522095 aattcgcatc cttcataaac aagacaggag gctgacggct ctagaagtcc ccaacctgtt SEQ ID NO: 223 gtgatcttca gtagacaaac actcctggtg tgtcacagga ttcaggccac taaacctcgg ccggctgtct cctggaatga agaaagcaaa ggaagcctag agtggagaca aagaagcccg aggcactctg agagctgcca tcttttcctt gtttgccgcc tgacacttct cagcaggatc cacataccct aaggagtgga agactccttg gcgcttggtg cttcaaccgg actgacttcc tgggcctgga gttggcgatt agaggtctga catggctcac ctgatgactg tgcagttgtt gctcctggtg atgttgatgg ccgaatgtgc tcagtccaga gctactcggg ccaggactga acttctcaat gtctgcatgg atgccaaaca ccacaaagaa aaaccgggcc ctgaggacaa tttacacgac cagtgcagcc cctggaagac gaatttctgc tgttccacga acacaagcca ggaagcacat aaggacattt cctaactgta acggttcaat gg AA386821 tcccatttcc tacctgtacc ggttcaactg gaaccactgc ggaactatga catcggaatg SEQ ID NO: 224 caaacggcac tttatccaag acacctgcct ctatgagtgt tccccgaact tgggaccctg gatccagcag gtggaccaga gctggcgcaa agagcggatc cttgatgttc ccctgtgcaa agaggactgt cagcagtggt gggaggactg ccagagctct tttacctgca agagcaattg gcacaaggga tggaactggt cctcggggca taacgagtgt cctgtgggag cctcctgcca tcccttcacc ttctacttcc ccacatctgc tgctctgtgt gaggaaatct ggagtcactc ctacgagctc ag AA386818 ctatcccatt tcctacctgt accggttcaa ctggaaccac tgcggaacta tgacatcgga SEQ ID NO: 225 atgcaaacgg cactttatcc aagacacctg cctctatgag tgttccccga acttgggacc ctggatccag caggtggacc agagctggcg caaagagcgg atccttgatg ttcccctgtg caaagaggac tgtcagcagt ggtgggagga ctgccagagc tcttttacct gcaagagcaa ttggcacaag ggatggaact ggtcctcggg gcataacgag tgtcctgtgg gagcctcctg ccatcccttc accttctact tccccacatc tgctgctctg tgtgaggaaa tctggagtca ctcctacaag ctcag AA386495 tatccctgag agctgccatc ttttccttgt ttgccgcctg acacttctca gcaggatcca SEQ ID NO: 226 cataccctaa gggagtggag agaggcctgg gctgggccag gttttctggg ctttttcctg tgctccgagt cagtgggttg tattttaccc agtaggagtg gaagactcct tggcgcttgg tgcttcaacc ggaactgact tcctgggcct ggagttggcg attagaggtc ctacatggct cacctgatga ctgtgcaagt tgtgcccccg gtgatgttga atggcggatg tgctcagtcc agaagtaatt tgggccaaga ctggacttct ccatggctgc attgatggca aacaccccaa aggaaaacgg ggccttgggg caattatcac ggccctgtaa cccttggaaa ccaattcccg ggttccgaaa cacagccgga AA289278 aattcggatc catgggctga tctggaagta taaacaagaa aggaggctga cggctctaga SEQ ID NO: 227 agtccccaac ctgttgtgat cttcagtaga caaacactcc tggtgtgtca caggattcag gccactaaac ctcggccggc tgtctcctgg aatgaagaaa gcaaaggaag cctagagtgg agacaaagaa gcccgaggac tctgagagct gccatctttt ccttgtttgc cgcctgacac ttctcagcag gatccacata ccctaaggga gtggagagag gcctgggctg ggcaggtttt ctgggctttt tcctgtgctc cgagtaggtg ggttgtattt tacccagtag gagtggaaga ctccttggcg cttggtgctt caaccggact gacttcctgg gcctggagtt ggcgattaga ggtctgacat ggctcacctg atgactgtgc agttgttgct cctggtgatg tggatggccg aatgtgctca gtccagagct actcgggcca ggactgaact tctcaatgtc tgcatggatg ccaa AA286342 ttggcatcca tgcagacatt gagaagttca gtcctggccc gagtagctct ggactgagca SEQ ID NO: 228 cattcggcca tccacatcac caggagcaac aactgcacag tcatcaggtg agccatgtca gacctctaat cgccaactcc aggcccagga agtcagtccg gttgaagcac caagcgccaa ggagtcttcc actcctactg ggtaaaatac aacccaccta ctcggagcac aggaaaaagc ccagaaaacc tggcccagcc caggcctctc tccactccct tagggtatgt ggatcctgct gagaagtgtc aggcggcaaa caaggaaaag atggcagctc tcagagtgcc AA276302 ttcggatcca tggtgctccg agtaggtggg ttgtatttta cccagtagga gtggaagact SEQ ID NO: 229 ccttggcgct tggtgcttca accggactga cttcctgggc ctggagttgg cgattagagg tctgacatgg ctcacctgat gactgtgcag ttgttgctcc tggtgatgtg gatggccgaa tgtgctcagt ccagagctac tcgggccagg actgaacttc tcaatgtctg catggatgcc aaacaccaca aagaaaaacc gggccctgag gacaatttac acgaccagtg cagcccctgg aagacgaatt cctgctgttc cacgaacaca agccaggaag cacataagga catttcctac ctgtaccggt tcaactggaa ccactgcgga actatgacat cggaatgcaa acggcacttt atccaagaca cctgcctcta tgagtgttcc ccgaacttgg gaccctggat ccagcaagtg gaccagagct ggcgcaagag cggatccttg aatgtccctg tgcaagagga ctgtcagcag tggtgggaga ctgcagagct ctt AA276123 aattcgggat ccatgggctg atctggaagt ataaacaaga aaggaggctg acggctctag SEQ ID NO: 230 aagtccccaa cctgttgtga tcttcagtag acaaacactc ctggtgtgtc acaggattca gctctgtttc ctaggccact aaacctcggc cggctgtctc ctggaatgaa gaaagcaaag gaagcctaga gtggagacaa agaagcccga ggcactctga gagctgccat cttttccttg tttgccgcct gacacttctc agcaggatcc acatacccta aggagtggaa gactccttgg cgcttggtgc ttcaaccgga ctgacttcct gggcctggag ttggcgatta gaggtctgac atggctcacc tgatgactgt gcagttgttg ctcctggtga tgtggatggc cgaatgtgct cagtccagag ctactcgggc caggactgaa cttctcaatg tctgcatgga tgccaaacac cacaaagaaa aaccgggccc tgaggacaat ttacacgacc agtgcagccc ctggaagacg aattcctgct gttccacgaa cacaagccag gaagcacata aggacat AA277280 attcggatcc acgtataaac aagaaaggag gctgacggct ctagaagtcc ccaacctgtt SEQ ID NO: 231 gtgatcttca gtagacaaac actcctggtg tgtcacagga ttcagctctg tttcctaggc cactaaacct cggccggctg tctcctggaa tgaagaaagc aaaggaagcc tagagtggag acaaagaagc ccgaggcact ctgagagctg ccatcttttc cttgtttgcc gcctgacact tctcagcagg atccacatac cctaagggag tggagagagg cctgggctgg gcaggttttc tgggcttttt cctgtgctcc gagtaggtgg gttgtatttt acccagtagg agtggaagac tccttggcgc ttggtgcttc aaccggactg acttcctggg cctggagttg gcgattagag gtctgacatg gctcacctga tgactgtgca gttgttgctc ctggtgatgt ggatggccga attggctcat tccaaagcta ctcgggccgg aactgaactc ctcaaggtct gcatggatgc aaacgccaca aagaaaa AA273543 gttcggatcc atgggctgat ctggaagtat aaacaagaaa ggaggctgac ggctctagaa SEQ ID NO: 232 gtccccaacc tgttgtgatc ttcagtagac aaacactcct ggtgtgtcac aggattcagc tctgtttcct aggccactaa acctcggccg gctgtctcct ggaatgaaga aagcaaagga agcctagagt ggagacaaag aagcccgagg cactctgaga gctgccatct tttccttgtt tgccgcctga cacttctcag caggatccac ataccctaag ggagtggaga gaggcctggg ctgggccagg ttttctgggc ttttcctgtg ctccgagtag gtgggttgta ttttacccag taggagtgga agactccttg gcgcttggtg cttcaaccgg actgacttcc tgggcctgga gttggcgatt agaggtctga catggctcac ctgatgactg tgcagttgtt gctcctggtg atgtggatgg cgaatgtgct cagtccagag ctactcgggc caagactgaa cttctcaatg tctgcatgga tgccaacacc acaagaaaaa cggggcttga gacaatttca cgacagtgca gccctggaaa aga 1189949 gaattcgcgg ccgctccggg aaggggggaa gggcacaact ccctcgggaa gctcgccgct SEQ ID NO: 233 gcctcctgga gcagaaggca gacaaagcca tgccctggaa gctgacagcc cttctgctct ttctggccgg ggtggtctcc gtgtgccgcg cccgggccag gacggacctg ctcaacgtct gcatggatgc caagcaccac aaggtagagc caggccctga ggacgagctg cacgaccagt gcgtcccctg gaagaagaac gcctgctgct ccgccagagt cagccacgag ctgcaccggg acaagtcctc cctgtataac ttttcctggg agcactgcgg caggatggag ccggcctgca agcgccactt cattcagaac aactgtctgt acgagtgctc gcccaacctg gggccctggt tccaggaggt gaaccagaag tggcgcaaag agcggttcct gaacgtgccc ctctgcaaag aggactgtct ggactggtgg gaagactgcc gcacctccta cacctgcaag agcagctggc acaagggctg gaactggagc tcaggatcta accagtgtcc cacggggacc acctgcgaca catttgagtc cttcttcccc acacccgcag cgctgtgtga gggcatctgg aatcacgatt ataagttcac caactacagc cggggcagcg gccgctgcat ccagatgtgg tttgacgcgg ccgagggcaa ccccaacgag gaggtagcga ggttctacgc cttggccttg agtgcgggga ccatgtccct tgggaccggg cctctcctgc tcagcgcagc cctgatgctg ccacttgggc tccttgactg agtccggcgt ctccagacgg tccttctgcc tgtccccagc tttgatgacc aggctggtct caactcagct cccaccaatg agggagccct aagcccgcct catctgttac ccatccctct gtcatcaagt tcctgccgta gggtgggcct tggggtctct ctgacagcca gttctaacag gcagattaac agcactgtgt ctgatgggct gttttggttg tgagctggtg tgtggcagag gacagagccc atagcttttg gattccttca gcttagagaa atgagacctg ggtttgaatt ccagctctgc cactcactat gtcaagtgaa gcagttgcgc gacggctcta aaccataggc tcctcctcaa taaaatgaag AA208306 aatctggagt cactcctaca agctcagcaa ctacagtcga gggagcggcc gctgcattca SEQ ID NO: 234 gatgtggttc gacccagccc agggcaaccc caacgaggaa gtggcgaggt tctatgccga ggccatgagt ggagctgggt ttcatgggac ctggccactc ttgtgcagcc tgtccttagt gctgctctgg gtgatcagct gagctcctgt tttaccttca gttgtctgga gcgccaccct gcttggctca gcctcccagc tcccagcctc ctttgtggtg gggctctgac agcctcttta ataaaccaga cattcca AA208089 cacgaacaca agccaggaag cacataagga catttcctac ctgtaccggt tcaactggaa SEQ ID NO: 235 ccactgcgga actatgacat cggaatgcaa acggcacttt atccaagaca catgcctcta tgagtattcc ccgaacttgg gaccctggat ccagcaggtg gaccagagct ggcgcaaaga gcggatcctt gatgttcccc tgtgcaaaga ggactgtcag cagtggtggg aggactgcca gagctctttt acctgcaaga gcaattggca caagggatgg aactggtcct cggggcataa cgagtgtcct gtgtgagcct cctgccatcg cttcaccttc tacttcccca catctgctgc tctgtgtgaa gaaatctgga gtcactccta caagcttaac aactacagtc gagggaagcg gccgctgcag tcagatgtgg ttcgacccag ccatggcaaa cccagcgagg aagttgcgag gtctatgccg aggcaatagt gagctggtgt ctgggactgg gcactttgt AA242285 aagactgtag agactaccca gagtctgacc tagggacagg ccaactcgga tacccctatg SEQ ID NO: 236 tgcgctccca gaagctaagg acattgagac agaaagacat ggcctggaaa cagacaccac tcttgctttt ggtctacatg gtcacaacag gcagtggcgg gacagaacag acctactcaa cgtttgcatg gatgccaaac accataagac aaagccgggc cccgaggaca agctgcatga ccagtgtagt ccatggaaga aaaatgcctg ttgctcagtc aacaccagcc aggagctaca caaggctgac tcccgtctgt acttcaactg ggatcactgt ggcaagatgg agcctgcctg taagagtcac ttcatccaag actcctgcct gtatgattgt ttcccaaacc ttggcccttg attcagtcaa gtggatcaag attgggctta aaaaggtttt cctgatgtgc ccctaatgca agaagacctg tcaccagtgt tggaaagctt gtggtacctc ctttactggc agaagagact ggcataaagc tcggact AA139715 attcggatcc atgggctgat ctggaagtat aaacaagaaa ggaggctgac ggctctagaa SEQ ID NO: 237 gtcccaacct gttgtgatct tcagtagaca aacactcctg gtgtgtcaca ggattcagct ctgtttccta ggccactaaa cctcggccgg ctgtctcctg gaatgaagaa agcaaaggaa gcctagatgt gagacaaaga agcccgaggc actctgagag ctgccatctt ttccttgttt gccgcctgac acttctcagc aggatccaca taccctaagg agtggaagac tccttggcgc ttggtgcttc aaccggactg acttcctggg cctggagttg gcgattagac tctgccttca gggtctgaca tggctcacct gatgactgtg cagttgttgc tcctggtgat gtggatggcc gaatgtgctc agtccagagc tactcgggcc aggactgaac ttctcaatgt ctgcatggat gccaaacacc acagagaaag accgggccct gaggacaatt ttacacgaca gtgcagcccc tggaagacga attcctgttg ttcacgaaca caagcaggat gacataggac atttctactg taccgttcac tggaac AA139709 aattcggatc catgggctga tctggaagta taaacaagaa aggaggctga cggctctaga SEQ ID NO: 238 agtccccaac ctgttgtgat cttcagtaga caaacactcc tggtgtgtca caggattcag ctctgtttcc taggccacta aacctcggcc ggctgtctcc tggaatgaag aaagcaaagg aagcctagag tggagacaaa gaagcccgag gcactctgag agctgccatc ttttccttgt ttgccgcctg acacttctca gcaggatcca cataccctaa ggagtggaag actccttggc gcttggtgct tcaaccggac tgacttcctg ggcctggagt tggcgattag actctgcctt cagggtctga catggctcac ctgatgactg tgcagttgtt gctcctggtg atgtggatgg ccgaatgtgc tcagtccaga gctactcggg ccaggactga acttctcaat gtctgcatgg atgccaaaca ccacaaagaa aaaccgggcg ctgaggacaa tttacacgac cagtgcagca cctggaagac gaattcctgg ctgttcacga gcacaagcta ggaagcacat aaggacattt tctanctgta ccggttcaac tggacccact gcggactatg acatcgga AA139675 attcggatcc atgcagctta gaagggcctc cagctttagg ctttatagat acctggccca SEQ ID NO: 239 cccttcccca gtcagcaggc tgatctggaa gtataaacaa gaaaggaggc tgacggctct agaagtcccc aacctgttgt gatcttcagt agacaaacac tcctggtgtg tcacaggatt caggccacta aacctcggcc ggctgtctcc tggaatgaag aaagcaaagg aagcctagag tggagacaaa gaagcccgag gcactctgag agctgccatc ttttccttgt ttgccgcctg acacttctca gcaggatcca cagacccgaa gtaggagtgg aagactcctt ggcgcttggt gcttcaaccg gactgacttc ctgggcctgg agttggcgat tagaggtctg acatggctca cctgatgact gtgcagttgt tgctcctggt gatgtggatg ggcgaatgtg ctcagtccag agctactcgg gccaggactg aacttctcaa tgtctgcatg gatgtcaaac accacaaaga aacaccgggc ctgaggacaa tttacacgac cagtgcagcc cctggaagac gaatcctgct gttccagaaa caagcaggag cacataggcc attcct AA139593 attcggatcc atgggctgat ctggaagtat aaacaagaaa ggaggctgac ggctctagaa SEQ ID NO: 240 gtccccaacc tgttgtgatc ttcagtagac aaacactcct ggtgtgtcac aggattcagg ccactaaacc tcggccggct gtctcctgga atgaagaaag caaaggaagc ctagagtgga gacaaagaag cccgaggcac tctgagagct gccatctttt ccttgtttgc cgcctgacac ttctcagcag gatccacata ccctaaggag tggaagactc cttggcgctt ggtgcttcaa ccggactgac ttcctgggcc tggagttggc gattagaggt ctgacatggc tcacctgatg actgtgcagt tgttgctcct ggtgatgtgg atggccgaat gtgctcagtc cagagctact cgggccagga ctgaacttct caatgtctgc atggatgcca aacaccacaa agaaaaaccg ggccctgagg acaatttaca cgaccagtgc atgccctgga agacgaattc ctgctgttcc acgaacacaa gccaggaagc acatagagac atttcctgct gtaccggttc aactggacca ctgcggaact atgacatcga atgcagacgc actttgccag gacactggct ctatgagtgt AA124010 aattcggatc catgggctct agaagtcccc aacctgttgt gatcttcagt agacaaacac SEQ ID NO: 241 tccgtggtgt gtcacaggat tcaggccact aaacctcggc cggctgtctc ctggaatgaa gaaagcaaag gaagcctaga gtggagacaa agaagcccga ggcactctga gagctgccat cttttccttg tttgccgcct gacacttctc agcaggatcc acatacccta aggagtggaa gactccttgg cgcttggtgc ttcaaccgga ctgacttcct gggcctggag ttggcgatta gaggtctgac atggctcacc tgatgactgt gcagttgttg ctcctggtga tgtggatggc cgaatgtgct aagtccagag ctactcgggc caggactgaa ctcctaaatg tctgcatgga tgccaaacac cacaaggaaa aacgggcccc tgaggacaat tacacgacca gtgcaagccc tggaagacga aattctgctg ttcaagacca caagccagta gcacataggg acattccaac ctgtaccgtt caacttgaac actgcggaat atgactcg AA108790 ccactaacca cataaggaca tttcctacct gtaccggttg acctgcaacg actgccgaac SEQ ID NO: 242 tatgacatcg caatgcacac gccactttat cgaccacacc tgcctctatg agtgttaccc gaacttcgca ccctccatcc accaggtgca cgacagctggcccacagagcg catccttca tgttcccctg tccacagacg actgtcagca gtcgtcccag cactcccaca gctctcttac ctgcaacacc aattcccaca acggatggaa ctcgtcctcg cggcatcacg agtgtcctgt agcaccctcc tgccatccct tcaccttcta cttccgcaca tctcgtgctc tgtgtgatga actctggagt cactcctaga cactcagcaa ctacagtcga cgg AA108350 aattcggatc catgcatgga tccggatcca tggcccctgg aagacgaatt cctgctgttc SEQ ID NO: 243 cacgaacaca agccaggaag cacataagga catttcctac ctgtaccggt tcaactggaa ccactgcgga actatgacat cggaatgcaa acggcacttt atccaagaca cctgcctcta tgagtgttcc ccgaacttgg gaccctggat ccagcaggtg gaccagagct ggcgcaaaga gcggatcctt gatgttcccc tgtgcaaaga ggactgtcag cagtggtggg aggactgcca gagctctttt acctgcaaga gcaattggca caagggatgg aactggtcct cggggcataa cgagtgtcct gtgggagcct cctgccatcc cttcaccttc tacttcccac atctgctgct ctgtgtgagg aatctggagt cactctacaa gctcagcact acagtcgagg agccgcc AA028831 ctgagtctga ggccagctgg tcgacaaggg tctgacatgg ctcacctgat gactgtgcag SEQ ID NO: 244 ttgttgctcc tggtgatgtg gatggccgaa tgtgctcagt ccagagctac tcgggccagg actgaacttc tcaatgtctg catggatgcc aaacaccaca aagaaaaacc gggccctgag gacaatttac acgaccagtg cagcccctgg aagacgaatt cctgctgttc cacgaacaca agccaggaag cacataagga catttcctac ctgtaccggt tcaactggaa ccactgcgga actatgacat cggaatgcaa acggcacttt atccaagaca cctgcctcta tgagtgttcc ccgaacttgg gaccctggat ccagcaggtg gaccagagct ggcgcaaaga gcggatcctt gatgttcccc tgtgcaaaga ggactgtcag cagtggtggg aggactgcca gagctctttt acctgcaaga gcaattggca caagggatgg aactggtcct cggggcataa cgagtgtcct gtgggagcct cctgccatcc gttcacttct acttcgcaca tctgctgtct gtgtgaggaa tctggagtca ctctacaagt ctagaataca gtcgaggacc ggc AA061275 aaccactgcg gaactatgac atcggaatgc aaacggcact ttatccaaga cacctgcctc SEQ ID NO: 245 tatgagtgtt ccccgaactt gggaccctgg atccagcagg tggaccagag ctggcgcaaa gagcggatcc ttgatgttcc cctgtgcaaa gaggactgtc agcagtggtg ggaggactgc cagagctctt ttacctgcaa gagcaattgg cacaagggat ggaactggtc ctcgggggca taacgagtgt cctgtgggag cctcctggca tcccttcagc ttctacttcc ccacatctgg ctgctcctgt gttaggaaaa tcttggattc actcctacca agcttcagca a W82933 aattcggcac taggggaggc tgacggctct agaagtcccc aacctgttgt gatcttcagt SEQ ID NO: 246 agacaaacac tcctggtgtg tcacaggatt cagctctgtt tcctaggcca ctaaacctcg gccggctgtc tcctggaatg aagaaagcaa aggaagccta gagtggagac aaagaagccc gaggcactct gagagctgcc atcttttcct tgtttgccgc ctgacacttc tcagcaggat ccacataccc taaggagtgg aagactcctt ggcgcttagt gctgctctgg gtgatcagct gagctcctgt tttaccttca gttgtctgga gcgccaccct gcttggctca gcctcccagc tcccagcctc ctttgtggtg gggctctgac agcctcttta ataaaccaga cattccaaaa aag AA015571 gtggacgaag actgtagaga ctacccagag tctgacctag ggagaggcca actcggatac SEQ ID NO: 247 ccctatgtgc gctcccagaa gctaaggaca ttgagacaga aagacatggc ctggaaacag acaccactct tgcttttggt ctacatggtc acaacaggca gtggccggga cagaacagac ctactcaacg tttgcatgga tgccaaacac cataagacaa agccgggccc cgaggacaag ctgcatgacc agtgtagtcc atggaagaaa aatgcctgtt gctcagtcaa caccagccag gagctacaca aggctgactc ccgtctgtac ttcaactggg atcactgtgg caagatggag cctgcctgta agagtcactt catccaagac tcctgcctgt atgagtgctc ccccaacctt gggccttgga tccagcaagt ggaccagagt tggcgtaaag agcgtttcct ggatgtgccc ttatgcagag aggactgtca ccagtggtgg gaagcctgtc gtacctcctt tacctgcaag agagactggc ataaaggctg ggaatggtcg tcaggcatgt acaagtgcgc aacacagcac ctgtacacgt gtgagtactc ttccgaacca gcagcttt W71715 gggctgtgga cgaagactgt agagactacc cagagtctga cctagggaga ggccaactcg SEQ ID NO: 248 gataccccta tgtgcgctcc cagaagctaa ggacattgag acagaaagac atggcctgga aacagacacc actcttgctt ttggtctaca tggtcacaac aggcagtggc cgggacagaa cagacctact caacgtttgc atggatgcca aacaccataa gacaaagccg ggccccgagg acaagctgca tgaccagtgt agtccatgga agaaaaatgc ctgttgctca gtcaacacca gccaggagct acacaaggct gactcccgtc tgtacttcaa ctgggatcac tgtggcaaga tggagcctgc ctgtaagagt cacttcatcc aagactcctg cctgtatgag tgctccccca accttgggcc ttggatccag caagtggacc agagttggcg taaagagcgt ttcctggatg tgccttatgc aaagaggact gtcaccagtg gtgggaagcc tgtcgtacgt cctttacctg caagagagac tggcataaag gctgggactg gtctcaggca ttaccagtgc caaacacagg accctgtaaa cgttgagtac tattccgaaa cagcagcc W59165 ttcggcacag ggggctgtgg acgaagactg tagagactac ccagagtctg acctagggag SEQ ID NO: 249 aggccaactc ggatacccct atgtgcgctc ccagaagcta aggacattga gacagaaaga catggcctgg aaacagacac cactcttgct tttggtctac atggtcacaa caggcagtgg ccgggacaga acagacctac tcaacgtttg catggatgcc aaacaccata agacaaagcc gggccccgag gacaagctgc atgaccagtg tagtccatgg aagaaaaatg cctgttgctc agtcaacacc agccaggagc tacacaaggc tgactcccgt ctgtacttca actgggagca ctgtggcaag atggagcctg cctgtaagag tcacttcatc caagactcct gcctgtatga gtgctccccc aaccttgggc cttggatcca gcaagtggac cagagttggc gtaaagagcg tttcctggat gtgcccttat gc X62753 ggaaaggatt ttctcagccc ccatctccag cactgtgtgt tggccgcacc catgagagcc SEQ ID NO: 250 tcagcactct gaaggtgcag ggggcaaagg ccaaaagagc tctggcctga acttgggtgg tccctactgt gtgacttggg gcatggcctc atctgtgctg aaatgattcc acaaagatta aactggctat catttgttga tttccccctt cttacattta atccttgcag gagaaagcta agcctcaaga tagtttgctt ctctttcccc caaggccaag gagaaggtgg agtgagggct ggggtcggga caggttgaac gggaaccctg tgctctaaca gttagggccc gccgaggaac tgaacccaaa ggatcacctg gtattccctg agagtacaga tttctccggc gtggccctca agggacagac atggctcagc ggatgacaac acagctgctg ctccttctag tgtgggtggc tgtagtaggg gaggctcaga caaggattgc atgggccagg actgagcttc tcaatgtctg catgaacgcc aagcaccaca aggaaaagcc aggccccgag gacaagttgc atgagcagtg tcgaccctgg aggaagaatg cctgctgttc taccaacacc agccaggaag cccataagga tgtttcctac ctatatagat tcaactggaa ccactgtgga gagatggcac ctgcctgcaa acggcatttc atccaggaca cctgcctcta cgagtgctcc cccaacttgg ggccctggat ccagcaggtg gatcagagct ggcgcaaaga gcgggtactg aacgtgcccc tgtgcaaaga ggactgtgag caatggtggg aagattgtcg cacctcctac acctgcaaga gcaactggca caagggctgg aactggactt cagggtttaa caagtgcgca gtgggagctg cctgccaacc tttccatttc tacttcccca cacccactgt tctgtgcaat gaaatctgga ctcactccta caaggtcagc aactacagcc gagggagtgg ccgctgcatc cagatgtggt tcgacccagc ccagggcaac cccaatgagg aggtggcgag gttctatgct gcagccatga gtggggctgg gccctgggca gcctggcctt tcctgcttag cctggcccta atgctgctgt ggctgctcag ctgacctcct tttaccttct gatacctgaa aatccctgcc ctgttcagcc ccacagctcc caactatttg gttcctgctc catggtcggg cctctgacag ccactttgaa taaaccagac accgc Z32564 cgcaggaata gatggacatg gcctggcaga tgatgcagct gctgcttctg gctttggtga SEQ ID NO: 251 ctgctgcggg gagtgcccag cccaggagtg cgcgggccag gacggacctg ctcaatgtct gcatgaacgc caagcaccac aagacacagc ccagccccga ggacgagctg tatggccagt gcagtccctg gaagaagaat gcctgctgca cggccagcac cagccaggag ctgcacaagg acacctcccg cctgtacaac tttaactggg atcactgtgg taagatggaa cccacctgca agcgccactt tatccaggac agctgtctct atgagtgctc acccaacctg gggccctgga tccggcaggt caaccagagc tggcgcaaag agcgcattct gaacgtgccc ctgtgcaaag aggactgtga gcgctggtgg gaggactgtc gcacctccta cacctgcaaa agcaactggc acaaaggctg gaattggacc tcagggatta atgagtgtcc ggccggggcc ctctgcagca cctttgagtc ctacttcccc actccagccg ccctttgtga aggcctctgg agccactcct tcaaggtcag caactatagt cgagggagcg gccgctgcat ccagatgtgg tttgactcag cccagggcaa ccccaatgag gaggtggcca agttctatgc tgcggccatg aatgctgggg ccccgtctcg tgggattatt gattcctgat ccaagaaggg tcctctgggg ttcttccaac aacctattct aatagacaaa tccacatgaa aaaaaaaaa T29279 catgagcagt gtcgaccctg gaggaagaat gcctgctgtt ctaccaacac cagccaggaa SEQ ID NO: 252 gcccataagg atgtttccta cctatataga ttcaactgga accactgtgg agagatggca cctgcctgca aacggcattt catccaggac acctgcctct acgagtgctc ccccaacttg gggccctgga tccagcaggt ggatcagagc tggcgcaaag agcgggtact gaacgtgccc ctgtgcaaag aggactgtna gcaaatggtg gggaagattg tcg M25317 gaattccgga caaggattgc atgggccagg actgagcttc tcaatgtctg catgaacgcc SEQ ID NO: 253  aagcaccaca aggaaaagcc aggccccgag gacaagttgc atgagcagtg tcgaccctgg aggaagaatg cctgctgttc taccaacacc agccaggaag cccataagga tgtttcttac ctatatagat tcaactggaa ccactgtgga gagatggcac ctgcctgcaa acggcatttc atccaggaca cctgcctcta cgagtgctcc cccaacttgg ggccctggat ccagcaggtg gatcagagct ggcgcaaaga gcgggtactg aacgtgcccc tgtgcaaaga ggactgtgag caatggtggg aagattgtcg cacctcctac acctgcaaga gcaactggca caagggctgg aactggactt cagggtttaa caagtgcgca gtgggagctg cctgccaacc tttccatttc tacttcccct ctcccactgt tctgtgcaat gaaatctgga ctcactccta caaggtcagc aactacagcc gagggagtgg ccgctgcatc cagatgtggt tcgacccagc ccagggcaac cccaatgagg aggtggcgag gttctatgct gcagccatga gtggggctgg gccctgggca gcctggcctt tcctgcttag cctggcctaa tgctgctgtg gctgctcagc tgacctcctt ttaccttctg atacctggaa atccctgccc tgttcagccc cacagctccc aactatttgg ttcctgctcc atggtcgggc ctctgacagc cattttgaat aaaccagaca ccgc M86438 cctgtgtctt cccgcatcca gtgtagtctc tggagaaaga atgcctgagc tttaccagca SEQ ID NO: 254 ccacccagga agcccataag aatattccca tctatatgga ttcaactgga accactgtgg agagatggta cctgcctgca aacggcactt tatccaggac acctgccttt acgagtgacc   ccccaacttg gggccctgga tccagcaggt atgcatggct tcctggcatc caagagctag cagaggagct gaattttcca ggcgtctctg caggcagcaa ccccagctcc acttctattc agggctgggt tcctgggatt cttgagcctg agcccttctt ttctaccaaa atctcccagg tggatcagag ctggtgcaaa gagtgggtgc tgaatgtgcc cctgtgcaaa gaggactgtg agcaatggtg ggaagattgt cgcacctcct acacctgcaa gagcaatggg cacaagggct ggaactggac ctcaggtgag ggctggggtg ggcaggaaag gagggatttg gaagtgaagg tgtgttgggt gtggaacagg tgtgtgacat tttggggttg tagggctggc agaatcagag acctttgggg cccagtggct aaaggtcttc cctcttccta cagggtctaa caagtgccag gtggcagctg cctgactacc tttccatctc tactttctca cacccactgc tctgtgcagt gaaatctgga ctcactccta cagggtcagc aactacaacc gagggagcag ccgctgcatc cagatgtggt tcgacctggc ccagggcaac cccaatgagg aggtggcaag gttctatgct gcagctctga gtggggctgg gccctgggca gcctggcctc tcctgctcaa cctggcccta atgctgctgt ggctgctcag ctgacctcct tttaccttct gatacttgga catccctgcc ctgtttagcc ccacagctcc caactatttg gttcctcttc tatggtcttg tctctgacag ccactttgaa taaaccacac accacacatg tatcttgaga attattt J03922 gaattcctct agggagaagt ctcacccaga aggacagcaa aagaggaaaa gaagggaaca SEQ ID NO: 255  acaatgctga ggtttgccat caccctcttt gctgtcatca catcatctac ctgccagcag tatggatgtc tggaagggga cacccacaaa gcgaagccaa gtcctgagcc aaacatgcat gaatgcactc tgtattctga atcttcctgt tgctatgcaa acttcacaga gcaattggct cattccccaa taattaaagt aagcaacagc tactggaaca gatgtgggca gctcagtaaa tcctgtgaag atttcacaaa gaaaatcgag tgcttttacc ggtgttctcc gcacgctgct cgctggatcg atcccagata tactgctgct attcagtctg ttccactgtg tcagagcttc tgtgatgact ggtatgaagc ctgcaaagat gattccattt gtgctcataa ctggctgacg gactgggaac gggatgaaag tggagaaaac cactgtaaga gtaaatgcgt accatacagt gagatgtatg caaatgggac cgacatgtgc cagagtatgt ggggggaatc ctttaaggtg agcgaatcct cctgcctctg cttgcaaatg aacaagaagg acatggtggc aatcaagcac ctcctctccg aaagctcaga ggaaagctcc agtatgagca gcagtgagga gcacgcctgc caaaagaaac tcctgaagtt tgaggcactg cagcaagagg aaggggaaga gagaagatga attttggtgg atgaatatca ggaggagagg aatcattgtg gaggttgtgc tcggggcatc acagcagcct gtcttatccc tcacttctga gaacacaata aatcaatggt tggctatatt M64817 gctttagagg cagatcaggg tgtagttttc agctagcgcc gtgccttccc caccatgttc SEQ ID NO: 256 cttgccatga tgataatgta ctagacctct gaaactgtag cttctttgtt acagagtctc cgtgaatctg gaattcacca attcggcgag tctgaaagcc tcagtgatct ctcaggctcc atctgtctcc actccccagt ggaaggcttg cagctgtgtc accgctccag acttcacaca ggtgctggaa gactgaacta agacagaaag acatggcctg gaaacagaca ccactcttgc ttttggtcta catggtcaca acaggcagtg gccgggacag aacagaccta ctcaacgttt gcatggatgc caaacaccat aagacaaagc cgggccccga ggacaagctg catgaccagt gtagtccatg gaagaaaaat gcctgttgct cagtcaacac cagccaggag ctacacaagg ctgactcccg tctgtacttc aactgggatc actgtggcaa gatggagcct gcctgtaaga gtcacttcat ccaagactcc tgcctgtatg agtgctcccc caaccttggg ccttggatcc agcaagtgga ccagagttgg cgtaaagagc gtttcctgga tgtgccctta tgcaaagagg actgtcacca gtggtgggaa gcctgtcgta cctcctttac ctgcaagaga gactggcata aaggctggga ctggtcctca ggcattaaca agtgcccaaa cacagcaccc tgtcacacgt ttgagtacta cttcccgaca ccagccagcc tttgcgaggg tctctggagt cactcctaca aggtcagcaa ctacagcaga gggagtggcc gctgcatcca gatgtggttt gactcaaccc agggcaatcc caatgaggac gtggtgaagt tttatgcttc ctttatgaca tctgggactg tgccccatgc agcagtactt cttgtgccca gcctggcccc agtgctgtca ttatggctcc ctggctgaga ggtcagtctt cctctctaga tttctcctct atctaccctt ggtctggttc aactcttcaa agaataagga agtcttgagc ctgcttccac ccctctcctc tgtcatccag ttcctgatcc atgttggggg ttggggtttc tacaatcatt ttcaataaat ctatgacaca tctgggccta atgaaaaaaa aaa L25338 ggatccaaga gattttatac tgtccttcag cactgtcctt cagttctttt tgtttttttg SEQ ID NO: 257 ttttttgttt tgttttgttt ttggtttttc gagacagggt ttctctgtgt agccctggct gtcctggaac tcactctgta gaccaggctg gcctcgaact cagaaatcca cctgcctctg cctcccaagt gctgggttta aaggcatacg ccaccacagc ccggctcttc ggttctttag gtcattattt tttggggtag ggggacaaac aaattctcac tatgtatcac agattggcct agaccccaca agccttcccc cttcccgtcc tccatgtcct ggggttgcag gcgtgtctca ccaattgcag ctgggcttgt tttgtgtgtt tccttttgag aggtttcggt cgggtcgggt gcttttgctg cagatgccgc tgtcaggatg ggctgtcagg gcagaatggc ttttggagaa caggaaagga aaatactgag gaagcaaaac tttacaaagc agcactcttt cttgtgtacc ctctaaccac accatcctgt gggctgtcac ttggtcctcc tgccaatctg gagaacttgg cagggctggg tcaccacctc cctcagggct aacaggactt ctaggctgac atgatgaccc agctgataca gagtggaatg ccgagaacct cctgtgacag gatgaaggat ctgtgtgtcc ctggcccttg tcaaggtagc aagcagcagg aacctgaact atttaactat gtgtcataaa gtctggaaat aagatgaaag catggggcat cccatcttct ctaggttgga aagctttgct tcttttataa cccccctccc caatgccatg gggccatggg ataaaagagt ctccttgctg acctctattc cagcttcagg gagcctgagg acatgaatgc tgaaggagaa gggactgatc taatctttca ctatagggac agagagtctg agtcagggaa taaatgaagt ccctcccccc tctggtctag gtctccctaa ctttagctcc ctctgcacag acagaaagac atggcctgga aacagacacc actcttgctt ttggtctaca tggtcacaac aggcagtggc cgggacagaa cagacctact caacgtttgc atggatgcca aacaccataa gacaaagccg ggccccgagg acaagctgca tgaccaggtt ctgtgccagt gtggtcctga tgggagggtg atagagggca gggtggggtt agtgagcagc cagacacacc cacaccctga gctcttgttg gcagagatgg cttggtggaa agtagtgagg tgattttctg agggctgtcc ccagaagagg acacagtagt ggcaatgaag cagttgatca ttagaagcct ctaattagag gccacgtgag gtcatgtgat gataatctct atatctctca aataagggcc cgtggaagca cagggactca ctctcacagg ttagacacac ctgatttttt tttttttgag agcattggtg ttttgcctac atatgtgttg gatcc M97701 tctagaattt tcagccctat cttaagcact atataacatg tgaaaaggaa caaaagggct SEQ ID NO: 258 tctaacacta gaaaaaattt aaggccaaac ataacttgta aagccatttt ccactttact tctgatagac tgtcttgaat ttccttagaa agttcaagat cagacttacc tcgttcccca gctgaaaagt tctgaattca tacagttgaa tccttcttaa cagtctgctt tacgggaacc tttatcaccg tcgttcccca gctgatgagt tctgaattcg gcagttgaat ccttctcaac agtctgtgtt acgggaacct tataaccttg attcgcagtt ctggttctgg aatgagggat cttccttgcg ccagtcccga gttttttctc gtcccggatt ttctcgtccc ggaattcggc accaattgtt attcgacgcg ttctcacgac cggccaggaa gaacaccaca gaccagaatc ttctgcgaca aagctttatt cttacatctt caggaaaaga gagcaagaag caagagagag caagaagcaa gagagggaag caagagagag caagaagcaa gagagggaag caagagagag caaagcaaga gagagagaaa aacgaaaccc cttctatttt aaagagaaca accattgcct agggcgcatc actccctgat tggctgcagc ccatggccga gctgacgttc acgggaaaaa cagagtacaa gtagtcgtaa atacccttgg ctcatgcgca gattatttgt ttaccaactt agaacacagg atgtcagcgc catcttgtga cggcgaatgt gggggcggct tcccacaagg ctccacccac tggagctgag cacacacttg gaggttccac ttaccttagc tctgccttca gggtctgaca tggctcacct gatgactgtg cagttgttgc tcctggtgat gtggatggcc gaatgtgctc agtccagagc tactcgggcc aggactgaac M97700 actagttgtg tctagatcct attgcactga tggtcatgaa gttgaaacat gggggaaaat SEQ ID NO: 259 gaactttata cccttcttca tgacttctgt ccttttgcct gcctcctttc tcatctccta atattacagt cttggtttcc tctctaaatt tttagacttt taacccacac ctaaacctgt atcagctttt ataaaaatct tttcaaaact tcacactgaa gcatctgcct ccaaaggttt tgaatgtgaa cgtgggtaaa ctctgttttt gcaaatggcc catctcttat tttttaattg ccctgtgtga gtctcaggac cactaagtct aacaggctgt gaccagtgat tgtctctagg gcatctgagc ctcacagagt ctgggaagac tgacaggagg aggtgaccca aggtctgtga gtgcaggctc cacccactgg agctgagcac acacttggag gttccactta ccttagctct gccttcaggg tctgacatgg ctcacctgat gactgtgcag ttgttgctcc tggtgatgtg gatggccgaa tgtgctcagt ccagagctac tcgggccagg actgaacttc tcaatgtctg catggatgcc aagcaccaca aagaaaaacc gggccctgag gacaatttac acgaccaggt aggacgaagg gtgatgtgtg gctgactaag ggctcgtggg tcaggagaaa gaagtatcta gtcccagttt atggtggagg tggtcagacc tacctgagga gaccttcggt tctctctagt gtgggtgact ttgacagtac atattggctg ccaactgcca gtgtgatatt atcagctcat cttcctggta gctgaatttt gacgttgcat aagtaaggaa gtagattcaa ggaggaactt gggaatggaa caggcaaacc attgtgatgg ttttagattt aaactgattg gggaggacgc ctctgggagt ctcaggggag ggactgtatg ctgcccagtc acttttctgc cagcctttga agacttgaga aggagactct catatctgag aagcctttgg aggcaggcat ctagcgaaca cttggactgt ggtcctcagc ttgagggctg gagggcttga gggctctgtg ttataacagt tgtttgccat agtgctttta gtatcccaaa gctcactaaa catttaataa aatcagtgtg atgcaacaac tatgaagtca accagcagca ggtctgctat tggggaggta caatcagtgc agacaacaaa gtgggagggg ggtctcaaaa aagccaagat gagggctgga gagttggctc agtggttaaa agcacttgtt gagcttgcag aataccaaggt ctgatccac aacatccaag M64782 tggagctgag cacacacttg gaggttccac ttaccttagc tctgccttca gggtctgaca SEQ ID NO: 260 tggctcacct gatgactgtg cagttgttgc tcctggtgat gtggatggcc gaatgtgctc  agtccagagc tactcgggcc aggactgaac ttctcaatgt ctgcatggat gccaagcacc acaaagaaaa accgggccct gaggacaatt tacacgacca gtgcagcccc tggaagacga attcctgctg ttccacgaac acaagccagg aagcacataa ggacatttcc tacctgtacc ggttcaactg gaaccactgc ggaactatga catcggaatg caaacggcac tttatccaag acacctgcct ctatgagtgt tccccgaact tgggaccctg gatccagcag gtggaccaga gctggcgcaa agagcggatc cttgatgttc ccctgtgcaa agaggactgt cagcagtggt gggaggactg ccagagctct tttacctgca agagcaattg gcacaaggga tggaactggt cctctgggca taacgagtgt cctgtgggag cctcctgcca tcccttcacc ttctacttcc ccacatctgc tgctctgtgt gaggaaatct ggagtcactc ctacaagctc agcaactaca gccgagggag cggccgctgc attcagatgt ggtttgaccc agcccagggc aaccccaacg aggaagtggc gaggttctat gccgaggcca tgagtggagc tgggcttcat gggacctggc cactcttgtg cagcctgtcc ttagtgctgc tctgggtgat cagctgagtt cctgttttac cttcagttgt ctggagcgcc accctgcttg gctcagcctc ccagctccca gcctcctttg tggtggggct ctgacagcct ctttaataaa ccagacattc cacatgtgcc ttatgaatta aaaaaaaaaa aaaaaaaaa M350369 acaaggattg catgggccag gactgagctt ctcaatgtct gcatgaacgc caagcaccac SEQ ID NO: 261 aaggaa J05013 ctggaggcct ggctggtgct cacatacaat aattaactgc tgagtggcct tcgcccaatc SEQ ID NO: 262 ccaggctcca ctcctgggct ccattcccac tccctgcctg tctcctaggc cactaaacca cagctgtccc ctggaataag gcaaggggga gtgtagagca gagcagaagc ctgagccaga cggagagcca cctcctctcc cagggacaga catggctcag cggatgacaa cacagctgct gctccttcta gtgtgggtgg ctgtagtagg ggaggctcag acaaggattg catgggccag gactgagctt ctcaatgtct gcatgaacgc caagcaccac aaggaaaagc caggccccga ggacaagttg catgagcagt gtcgaccctg gaggaagaat gcctgctgtt ctaccaacac cagccaggaa gcccataagg atgtttccta cctatataga ttcaactgga accactgtgg agagatggca cctgcctgca aacggcattt catccaggac acctgcctct acgagtgctc ccccaacttg gggccctgga tccagcaggt ggatcagagc tggcgcaaag agcgggtact gaacgtgccc ctgtgcaaag aggactgtga gcaatggtgg gaagattgtc gcacctccta cacctgcaag agcaactggc acaagggctg gaactggact tcagggttta acaagtgcgc agtgggagct gcctgccaac ctttccattt ctacttcccc acacccactg ttctgtgcaa tgaaatctgg actcactcct acaaggtcag caactacagc cgagggagtg gccgctgcat ccagatgtgg ttcgacccag cccagggcaa ccccaatgag gaggtggcga ggttctatgc  tgcagccatg agtggggctg ggccctgggc agcctggcct ttcctgctta gcctggccct aatgctgctg tggctgctca gctgacctcc ttttaccttc tgatacctgg aaatccctgc cctgttcagc cccacagctc ccaactattt ggttcctgct ccatggtcgg gcctctgaca gccactttga ataaaccaga caccgcac M28099 ggagagccac ctcctctccc aggaactgaa cccaaaggat cacctggtat tccctgagag SEQ ID NO: 263 tacagatttc tccggcgtgg ccctcaaggg acagacatgg ctcagcggat gacaacacag ctgctgctcc ttctagtgtg ggtggctgta gtaggggagg ctcagacaag gattgcatgg gccaggactg agcttctcaa tgtctgcatg aacgccaagc accacaagga aaagccaggc cccgaggaca agttgcatga gcagtgtcga ccctggagga agaatgcctg ctgttctacc aacaccagcc aggaagccca taaggatgtt tcctacctat atagattcaa ctggaaccac tgtggagaga tggcacctgc ctgcaaacgg catttcatcc aggacacctg cctctacgag tgctccccca acttggggcc ctggatccag caggtggatc agagctggcg caaagagcgg gtactgaacg tgcccctgtg caaagaggac tgtgagcaat ggtgggaaga ttgtcgcacc tcctacacct gcaagagcaa ctggcacaag ggctggaact ggacttcagg gtttaacaag tgcgcagtgg gagctgcctg ccaacctttc catttctact tccccacacc cactgttctg tgcaatgaaa tctggactca ctcctacaag gtcagcaact acagccgagg gagtggccgc tgcatccaga tgtggttcga cccagcccag ggcaacccca atgaggaggt ggcgaggttc tatgctgcag ccatgagtgg ggctgggccc tgggcagcct ggcctttcct gcttagcctg gccctaatgc tgctgtggct gctcagctga cctcctttta ccttctgata cctggaaatc cctgccctgt tcagccccac agctcccaac tatttggttc ctgctccatg gtcgggcctc tgacagccac tttgaataaa ccagacaccg c J02876 gaatcaattc ctccaaaccg caagaacagt aacatttatt attcaaaaaa acaaaaacca SEQ ID NO: 264 gattatagga tatgacattt ggtataacaa taatgttatt gaaaaatgga aaaatgatcc attaatggct tgggctaaaa attcggggga cagcctaggg gcctggatct attgcctact tagagagagg ccaactcaga cacagccgtg tatgctccca gcagcaacgg aggttcacgt ccgcctgcag ggacagaaag acatggtctg gaaatggatg ccacttctgc tgcttctggt ctgtgtagcc accatgtgca gtgcccagga caggactgat ctcctcaatg tctgtatgga tgccaagcac cacaagacaa agccaggtcc tgaggacaag ctgcatgacc aatgcagtcc ctggaagaag aatgcctgct gcacagccag caccagccag gagctgcaca aggacacctc ccgcctgtac aactttaact gggaccactg cggcaagatg gagcccgcct gcaagcgcca cttcatccag gacacctgtc tctatgagtg ctcacccaac ctggggccct ggatccagca ggtgaatcag acgtggcgaa aagaacgctt cctggatgtg cccttatgca aagaggactg tcagcgctgg tgggaggatt gtcacacctc ccacacgtgc aagagcaact ggcacagagg atgggactgg acctcaggag ttaacaagtg cccagctggg gctctctgcc gcacctttga gtcctacttc cccactccag ctgccctttg tgaaggcctc tggagtcact catacaaggt cagcaactac agccgaggga gcggccgctg catccagatg tggtttgatt cagcccaggg caaccccaac gaggaagtgg cgaggttcta tgctgcagcc atgcatgtga atgctggtga gatgcttcat gggactgggg gtctcctgct cagtctggcc ctgatgctgc aactctggct ccttggctga gttcagtcct cccagactac ctgccctcag cttggataac caggctgggc tcagctcagc tcccacaaat gacagcccct taagcatgct tctattagtc acctaaccct ctgtcaccca gtctgttgct gctccatggt ggggccaaga gtcacttcta ataaacagac tgttttctaa taaaaaaaaa aaaaaaaaaa 1J08471 cgcaggaata gatggacatg gcctggcaga tgatgcagct gctgcttctg gctttggtga SEQ ID NO: 265 ctgctgcggg gagtgcccag cccaggagtg cgcgggccag gacggacctg ctcaatgtct gcatgaacgc caagcaccac aagacacagc ccagccccga ggacgagctg tatggccagt gcagtccctg gaagaagaat gcctgctgca cggccagcac cagccaggag ctgcacaagg acacctcccg cctgtacaac tttaactggg atcactgtgg taagatggaa cccacctgca agcgccactt tatccaggac agctgtctct atgagtgctc acccaacctg gggccctgga tccggcaggt caaccagagc tggcgcaaag agcgcattct gaacgtgccc ctgtgcaaag aggactgtga gcgctggtgg gaggactgtc gcacctccta cacctgcaaa agcaactggc acaaaggctg gaattggacc tcagggatta atgagtgtcc ggccggggcc ctctgcagca cctttgagtc ctacttcccc actccagccg ccctttgtga aggcctctgg agccactcct tcaaggtcag caactatagt cgagggagcg gccgctgcat ccagatgtgg tttgactcag cccagggcaa ccccaatgag gaggtggcca agttctatgc tgcggccatg aatgctgggg ccccgtctcg tgggattatt gattcctgat ccaagaaggg tcctctgggg ttcttccaac aacctattct aatagacaaa tccacatgaa aaaaaaaaa U02714 gaggagggta tggggaggca cttagttcct gtgtcttccc cacccagtgc agtccctgga SEQ ID NO: 266 agaagaatgc ctgctgcaca gccagcacca gccaggagct gcacaaggac acctcccgcc tgtacaactt taactgggac cactgcggca agatggagcc cgcctgcaag cgccacttca tccaggacac ctgtctctat gagtgctcac ccaacctggg gccctggatc cagcaggtag ggtgtctccc ccccacccac cccagcagac tgccatcccc ctcagtcact tcaaggcgat ggctgccagc atccctggct gagaggagcc ctgcctcccc acctcccacc caggtgaatc agacgtggcg caaagaacgc ttcctggatg tgcccttatg caaagaggac tgtcagcgct ggtgggagga ttgtctcacc tcccacacgt gcaagagcaa ctggcacaga ggatgggact ggacctcagg tgagggtgat tgagttgggg ttaggaaaaa ggagattgag gtagggtttg gaaaatcctc aaggatttgg ggtggggtga agatttctgg gggtggccag aaatgagctt tgggcccagg ggctgaaagt ctgtgtccac catgcctctc cctgcaggag ttaacaagtg cccagctggg gctctctgcc gcacctttga gtcctacttc cccactccag ctgccctttg tgaaggcctc tggagtcact catacaaggt cagcaactac agccgaggga gcggccgctg catccagatg tggtttgatt cagcccaggg caaccccaac gaggaagtgg cgaggttcta tgctgcagcc atgcatgtga atgctggtga gatgcttcat gggactgggg gtctcctgct caggctggcc ctgatgctgc aactctggct ccttggctga gttcagtcct cccagactac ctgccctcag cttggataac caggctgggc tcagctcagc tcccacaaat gccagcccct taagcatgct tctattagtc acctaaccct ctgtcaccca gtctgttgct gctccatggt ggggccaaga gtcacttcta ataaacagac tgttttctaa taa U02716 agcttcaggg ccccagcatc gaaggaacag ggtctgacct catttgccac cgtagggatg SEQ ID NO: 267 gggagactga ggcaggaggt gaatggctcc cagcttggag ccctttcccc tcaggacttg gtttccctac cctacgtccg cctgcaggga cagaaagaca tggtctggaa atggatgcca cttctgctgc ttctggtctg tgtagccacc atgtgcagtg cccaggacag gactgatctc ctcaatgtct gtatggatgc caagcaccac aagacaaagc caggtcctga ggacaagctg catgaccaag tacggctgga gtgtgcctct gctaaggagg ggcttgttct aacagggagg agaaagtcag gatg SEQ ID NO: 269 L L S L A L M L L (E41)

REFERENCES

The following references, to the extent that they provide exemplary procedural or other details supplementary to those set forth herein, are specifically incorporated herein by reference.

Patents

-   U.S. Pat. No. 3,826,364; issued Jul. 30, 1974. -   U.S. Pat. No. 4,284,412; issued Aug. 18, 1981. -   U.S. Pat. No. 4,498,766; issued Feb. 12, 1985. -   U.S. Pat. No. 4,578,770; issued Mar. 25, 1986. -   U.S. Pat. No. 4,596,792; issued Jun. 24, 1986. -   U.S. Pat. No. 4,599,230; issued Jul. 8, 1986. -   U.S. Pat. No. 4,599,231; issued Jul. 8, 1986. -   U.S. Pat. No. 4,601,903; issued Jul. 22, 1986. -   U.S. Pat. No. 4,608,251; issued Aug. 26, 1986. -   U.S. Pat. No. 4,661,913; issued Apr. 28, 1987. -   U.S. Pat. No. 4,714,682; issued Dec. 22, 1987. -   U.S. Pat. No. 4,767,206; issued Aug. 30, 1988. -   U.S. Pat. No. 4,774,189; issued Sep. 27, 1988. -   U.S. Pat. No. 4,857,451; issued Aug. 15, 1989. -   U.S. Pat. No. 4,989,977; issued Feb. 5, 1991. -   U.S. Pat. No. 5,160,974; issued Nov. 3, 1992. -   U.S. Pat. No. 5,478,722; issued Dec. 26, 1995.

Publications

-   Acres B., Hareuveni M., Balloul J. M. and Kieny M. P. (1993) VV-MUC1     immunisation of mice-immune response and protection against the     growth of murine tumours bcaring the MUC1 antigen J. Immunother.     14:136-143. -   Acres B., Apostolopoulos V., Balloul J. M., Wreschner D. Xing P. X.,     Hadi D. A. et al. (1999) MUC1 specific cytotoxic T cell precursor     analysis in human MUC1 transgenic mice immunised with human MUC1     vaccines. Cancer Immunol. Immunother. 2000 January; 48(10):588-94. -   Almendro et al., “Cloning of the human platelet endothelial cell     adhesion molecule-1 promoter and its tissue-specific expression.     Structural and functional characterization,” J. Immunol.     157(12):5411-5421, 1996. -   Anichini, A. et al., (1993) et al., J. Exp. Med. 177:989-998. -   Apostolopoulos V., Haurum J. S., and McKenzie I. F. C. (1997) MUCI     peptide epitopes associated with 5 different H2 class I molecules.     Eur. J. Immunol. 27:2579-2587. -   Apostolopoulos V., Karanikas V., Haurum J. and     McKenzie I. F. C. (1997) Induction of HLA-A2 restricted cytotoxic T     lymphocytes to the MUCI human breast cancer antigen J. Immunol.     159:56211-5218. -   Apostolopoulos V., Chelvanayagam G., Xing P.-X and     McKenzie I. F. C. (1998) Anti-MUCI antibodies react directly with     MUCI peptides presented by class I 142 and HLA molecules J. Immunol.     161:767-775. -   Apostolopoulus V. Xing P.-X. and McKenzic I. F. C. (1994) Murine     immuno response to cells transfected with human MUC1: Immunisation     with cellular and synthetic antigens. Cancer Res. 54: 5186-5193. -   Apostolopoulos V., Pietersz G. A., Loveland B. E., Sandrin M. S, and     McKenzie I. F. C. (1995) Oxidative/reductive conjugation of mannan     to antigen selects for T1 or T2 immune responses. Proc. Natl. Acad.     Sci. USA 92: 10128-10132. -   Apostolopoulos V., Popovski V. and McKenzie I. F. C. (1998)     Cyclophosphamide enhances the CTL precursor frequency in mice     immunized with MUC1-mannan fusion protein (M-FP). J. Immunother.     21:109-113. -   Astori M. and Krachenbuhl J. P. (1996) Recombinant fusion peptices     containing single or multiple repeats of a ubiquitous T-helper     epitope are highly immunogenic. Mol. Immunol. 33: 1017-1024. -   Barth, R. J., et al., (1991) J. Exp. Med. 173:647-658. -   Bartnes K., Hannestad K., Guichard G. and Briand J.P. (1997) A     retro-inverso analog mimics he cognate peptide epitope of a CD4+ T     cell clone. Eur. J. Immunol. 27:1387-1391. -   Beekman N. J., Schaaper W. M., Tesser G. I., Dalsgaard K., Kamstrup     S., Langeveld J. P. et al. (1997) Synthetic peptide vaccines:     palmitoylation of peptide antigens by a thioester bond increases     immunogenicity. J. Pept. Res. 50: 357-364. -   BenMohamed L., Gras-Masse H., Tarter A., Daubersies P., Bahimi K.,     Bossus M. et al. (1997) Lipopeptide immunization without adjuvant     induces potent and long-lasting B. T. helper, and cytotoxic T     lymphocyte responses against a malaria liver stage antigen in mice     and chimpanzees, Eur. J. immunol. 27: 1242-1253. -   Blaese, R. M., Pediatr. Res., 33 (1 Suppl):S49-S53 (1993). -   Briand J. P., Benkirane N., Guichard G., Newman J. F. E., Van     Regenmortelo M. H., Brown F. et al. (1997) A retro-inverso peptide     corresponding to the GH loop of foot-and-mouth disease virus elicits     high levels of long-lasting protective neutralizing antibodies.     Proc. Natl. Acad. Sci. USA 94: 12545-12550. -   Chakraborty N. G., Sporn J. R., Tortora A. F., Kurtzman S. H.,     Yamase H., Ergin M. T. et al. (1998) Immunization with a     tumor-cell-lysate-loaded autologous-antigen-presenting-cell-based     vaccine in melanoma. Cancer Immunol. Immunother, 47: 58-64. -   Chen T. T., Tao M. H. and Levy R. (1994) Idiotype-cytokine fusion     proteins as cancer vaccines. Relative efficacy of IL-2, IL-4 and     granulocyte-macrophage colony-stimulating factor. J. Immunol.     153:4775-4787. -   Ciupitu A. M. Petersson M., O'Donnell C. L., Williams K., Jindal S.,     Kiessling R. et al. (1998) Immunization with a lymphocytic     choriomeningitis virus peptide mixed with heat shock protein 70     results in protective antiviral immunity and specific cytotoxic T     lymphocytes. J. Exp. Med. 187:685-691. -   Creswell P. (1994) Assembly, transport and function of MHC class I     molecules. Ann. Rev. Immunol. 12:259-293. -   Culver, L., et al. Proc. Natl. Acad. Sci. USA, 88:3155-3159 (1991). -   Dalgleish, A. G. Cancer vaccines. Br. J. Cancer 82(10): 1619-1624. -   Darrow, T. L., et al., (1989) J. Immunol. 142:3329-3335. -   DeLeo A. B. (1998) p53-based immunotherapy of cancer. Crit. Rev.     Immunol. 18: 29-35. -   Deprez B., Sauzet J. P., Boutillon C., Martinon F., Tartar A.,     Sergheraert C. et al. (1996) Comparative efficiency of simple     lipopeptide constructs for in vivo induction of virus-specific CTL.     Vaccine 14: 375-382. -   Derossi D., Joliot G., Chassaing G. and Prochiantz A. (1994) The     third helix of the Antennapedia homeodomain translocates through     biological membranes. J. Biol. Chem. 269: 10444-10450. -   Derossi D., Calvet S., Trembleau A., Brunissen A., Chassaing G. and     Prochiantz A. (1996) Cell internalization of the helix of the     Antennapedia homeodomain is receptor-independent. J. Biol. Chem.     271: 18188-18193. -   Ding L., Lalani E. N. and Reddish M. (1993) Immunogenicity of     synthetic peptides related to the core peptide sequence encoded by     the human MUC1 gene: effect of immunisation on the growth of murine     mammary adenocarcinoma cells transfected with the human MUC1 gene.     Cancer Immunol. Immunother. 36:9-17. -   Disis M. L., Bernhard H., Shiota F. M., Hand S. L., Gralow J. R.,     Huseby E. S. et al. (1996) Granulocyte macrophage colony-stimulating     factor: an effective adjuvant for protein and peptide-based vaccines     Blood 88:-202-210 -   Donnelly J. J., Ulmer J. B., Hawe L. A., Friedman A., Shi X. P.,     Leander K. R. et al. (1993) Targeted delivery of peptide epitopes to     class I major histocompatibility molecules by a modified Pseudomanas     exotoxin. Proc. Natl. Acad. Sci. USA 90: 3530-3534. -   Elwood, P. C. Molecular cloning an dcharacterization of the human     folate binding protein cDNA from placenta and malignant tissue     culture (KB) cells. J. Biol. Chem. 264: 14893-14901, 1989. -   Fayolle C., Sebo P., Ladant D., Ullmann A. and Leclerc C. (1996) In     vivo induction of CTL responses by recombinant adenylate cyclase of     Bordetella pertussis carrying viral CD8+ T cell epitopes. J.     Immunol. 156:4697-4706. -   Fukasawa M., Shimizu Y., Shikata K., Nakata M., Sakak-ibara R.,     Yamamoto N. et al. (1998) Liposome oligomannase-coated with     neoglycolipid, a new candidate for a safe adjuvant for induction of     CD8+ cytotoxic T lymphocytes. FEBS Lett. 441: 353-356. -   Garin-Chesa, P., Campbell, I. Suigo, P. E., Lewis, J. L., Old, L.     J., and Rettig, W. J. Trophoblast and ovarian cancer antigen LK26.     Sensitivity and specificity in immunopathology and molecular     identification as a folate binding protein. Am. J. Pathol., 142:     557-567, 1993. -   Gendler S. J., Papadimitriou J. T., Duhig T., Rothbard J. and     Burchell J. (1998) A highly immunogenic region of human polymorphic     epithelial mucin expressed by carcinomas is made up of tandem     repeats, J. Biol. Chem. 263:12820-12823. -   Goletz T. J., Klimpel K. R., Arora N., Leppla S. H., Keith J. M. and     Berzofsky J. A. (1997) Targeting HIV proteins to the major     histocompatibility complex class I processing pathway with a novel     gp120-antrax toxin fusion protein, Proc. Natl. Acad. Sci. USA 94:     12059-12064. -   Gong J., Chen D., Kashiwaba M. and Kufe D. (1997) Induction of     antitumour activity by immunization with fusions of denddritic and     carcinoma cells. Nature Med. 3: 558-561. -   Gong J., Chen D., Kashiwaba M., Li Y., Chen L., Takeuchi H. et     al. (1998) Reversal of tolerance to human MUC1 antigen in MUC1     transgenic mice immunized with fusions of dendritic and carcinoma     cells. Proc. Natl. Acad. Sci. USA 95: 6279-6283. -   Goydos J. S., Elder E., Whiteside T. L., Finn O. J. and     Lotze M. T. (1996) A phase I trial of a synthetic mucin peptide     vaccine. Induction of specific immune reactivity in patients with     adenocarcinoma. J. Surg. Res. 63: 298-304. -   Gras-Masse H., Boutillon C., Diesis E., Deprez B. and     Tartar A. (1997) Confronting the degeneracy of convegent     combinatorial immunogens or ‘mixotopes’, with the specificity of     recognition of the target sequences. Vaccine 15:1568-1578. -   Guan H. H., Budzynski W., Koganty R. R., Kantz M. J., Reddish M. A.,     Rogers J. A. et al (1998) Liposomal formulations of synthetic MUC1     peptides: effects of encapsulation versus surface display of     peptides on immune responses. Bioconjug. Chem. 9:451-458. -   Guichard G., Connan F., Graff R., Ostankovitch M., Muller S.,     Guillet J. G. et al. (1996) A partially modified retro-inverso     pseudopeptide as a non-natural ligand for the human class I     histocompatibility molecule HLA-A2. J. Med. Chem. 39: 2030-3039. -   Hurpin C, Rotarioa C, Bisceglia H, Chevalier M, Tartaglia J,     Erdile L. The mode of presentation and route of administration are     critical for the induction of immune responses to p53 and antitumor     immunity. Vaccine. 1998 January-February; 16(2-3):208-15. -   Heeg K., Kuon W. and Wagner H. (1991) Vaccination of class I major     histocompatibility complex (MHC)-restricted murine CD8+ cytotoxic T     lymphocytes towards soluble antigens: immunostimulating-ovalbumin     complexes enter the class I MHC-restricted antigen pathway and allow     sensitization against the immunodominant peptide. Eur. J. Immunol.     21: 1523-1527. -   Heike M., Noll B. and Meyer zum Buschenfelde K. H. (1996) Heat shock     protein-peptide completes for use in vaccines. J. Leukoc. Biol. 60:     153-158. -   Henderson R. A., Konitsky W. M., Barratt-Boyes S. M., Soares M.,     Robbins P. D. and Finn O. J. (1998) Retroviral expression of MUC-1     human tumor antigen with intact repeat structure and capacity to     elicit immunity in vivo. J. Immunother. 21:247-256. -   Henderson R. A., Nimgaonkar M. T., Watkins S. C., Robbins P. D.,     Ball E. D. and Finn O. J. (1996) Human dendritic cells genetically     engineered to express high levels of the human epithelial tumor     antigen mucin (MUC-1). Cancer Res. 56:3763-3770. -   Herve M., Maillere B., Mourier G., Texier C., Leroy S, and     Menez A. (1997) On the immunogenic properties of retro-inverso     peptides. Total retro-inversion of T-cell epitopes causes a loss of     binding to MHC II molecules. Mol. Immunol. 34:157-163. -   Hom, S. S., et al., (1991) J. Immunother. 10:153-164. -   Hom, S. S., et al., (1993) J. Immunother. 13:18-30. -   Hsu S. C., Schadeck E. B., Delmas A., Shaw M. and Stewart M.     W., (1996) Linkage of a fusion peptide to a CTL epitope from the     nucleoprotein of measles virus enables incorporation into ISCOMs and     induction of CTL responses following intranasel immunization.     Vaccine 14:1159-1166. -   Hwu, P., et al. J. Immunol, 150:4104-415 (1993).

Itoh, K. et al., (1986), Cancer Res. 46:3011-3017.

-   Jerome K. R., Domenech N. and Finn O. J. (1993) Rumor-specific CTL     clones from patients with breast and pancreatic adenocarcinoma     recognize EBV-immortalized B cells transfected with polymorphic     epithelial mucin cDNA. J. Immunol. 151: 1654-1662. -   Karanikas V., Hwang L., Pearson J., Ong C. S., Apostolopoulos V.,     Vaughan H. et al. (1997) Antibody and T cell responses of patients     with adenocarcinoma immunized with mannan-MUC1 fusion protein. J.     Clinical Invest. 100: 2783-2792. -   Kawakami, Y., et al., (1992) J. Immunol. 148:638-643. -   Kawakami, Y., et al., (1993) J. Immunother. 14:88-93. -   Kawakami Y., Robbins P. F., Wanx X., Tupesis J. P., Parkhurst M. R.,     Kang X. et al. (1998) Identification of New melanoma epitopes on     melanosomal proteins recognized by tumor infiltrating T lymphocytes     restricted by HLA-A1, -A2, and -A3 alleles J. Immunology     161:6985-6992. -   Kim, D., Lee, T. V., Castilleja, A., Anderson, B. W., Papler, G. E.     Kudella, A. P., Murray, J. L., Sittisomwong, T., Wharton, J. T.,     Kim, J. Ioannides, C. G. Folate binding protein peptide 191-199     presented on dendritic cells can simulate CTL from ovarian and     breast cancer patients. Anticancer Res., 18:2907-2916, 1999. -   Kim D. T., Mitchell D. J., Brockstedt D. G., Fong L., Nolan G. P.,     Fathman C. G. et al. (1997) Introduction of soluble proteins into     the MHC class I pathway by conjugation to an HIV tat peptide. J.     Immunol: 159: 1666-1668. -   Kraus et al., “Alternative promoter usage and tissue specific     expression of the mouse somatostatin receptor 2 gene,” FEBS Lett.,     428(3):165-170, 1998. -   Lareyre et al., “A 5-kilobase pair promoter fragment of the murine     epididymal retinoic acid-binding protein gene drives the     tissue-specific, cell-specific, and androgen-regulated expression of     a foreign gene in the epididymis of transgenic mice,” J Biol. Chem.,     274(12):8282-8290, 1999. -   Lee et al., “Activation of beta3-adrenoceptors by exogenous dopamine     to lower glucose uptake into rat adipocytes,” J Auton Nerv Syst.     74(2-3):86-90, 1997. -   Lee, T. V., Anderson, B. W., Peoples, G. E., Castilleja, A.,     Murray, J. L., Gershenson, D. M., and Ioannides, C. G.     Identification of activated tumor-Ag-reactive CD8+ cells in healthy     individuals, Oncology Reports, 7:455-466, 2000. -   Lee R. S., Tartour E., van der Bruggen P., Vantomme V., Joyeaux I.,     Goud B. et al., (1998) Major histocompatibility complex class I     presentation of exogenous soluble tumour antigen fused to the     B-fragment of Shiga toxin. Eur. J. Immunol. 28:2726-2737. -   Lees C. J. Apostolopoulos V., Acres B. A., Ong C.-S., and T2     cyokines on the cytotoxic T cell response to mannan-MUCI. Cancer     Immuno. Immother. 2000 February; 48(11):644-52. -   Li, P. Y., Del Vecchio, S., Fonti, R., Carrieto, M. V., Potena, M.     I., Botti, G., Miotti, S., Lastoria, S., Menard, S., Colnaghi, M. I.     and Salvatore, M. Local characterization of folate binding protein     GP38 in sections of human ovarian carcinoma by in vitro quantitative     autoradiography. J. Nucl. Med. 37:665-672, 1996. -   Lofthouse S. A., Apostolopoulos V., Piertersz G. A. and     McKenzie I. F. C. (1997) Induction of T1 (CTL) and/or T2 (antibody)     response to a mucin 1 tumor antigen, Vaccine 25: 1586-1593. -   Lustgarten J., Theobald M., Labadic C., LaFacc D., Peterson P.,     Disis M. L. et al. (1997) Identification of Her-2/NeuCTL epitopes     using double transgenic mice expressing HLA-A2.1 and human CD*. Hum.     Immunol. 52:109-118. -   Malcherek G., Wirblich C., Willcox N., Rammensee H. G., Trowsdale J.     and Melms A. (1998) MHC class II-associated invariant chain peptice     replacement by T cell epitopes: engineered invariant chain as a     vehicle for directed and enhanced MHC class II antigen processing     and presentation. Eur. J. Immunol. 28:1524-1533. -   Matco, L., Gardner J., Chen Q., Schmidt C., Down M., Elliott S. L.     et al. (1999) An HLA-A2 polyepitope vaccine for melanoma     immunotherapy. J. Immunol. 163:4058-4063. -   McCarty T. M., Liu X., Sun J. Y., Peralta E. A., Diamond D. J. and     Ellenhom J. D. (1998) Targeting p53 for adoptive T-cell     immunotherapy. Cancer Res. 58: 2601-2605. -   Minev B. R., McFarland B. J., Spiess P. J., Rosenberg S. A. and     Restifo N. P. (1994) Insertion signal sequence fused to minimal     peptides elicits specific CD8+ T-cell responses and prolongs     survival of thymoma-bearing mice. Cancer Res. 54:4155-4161. -   Muul, L. M., et al. (1987), J. Immunol. 138:989-995. -   Nakanishi T., Kunisawa J., Hayashi A., Tsutsumi Y., Kubo K.,     Nakagawa S. et al. (1997) Positively charged liposome functions as     an efficient immunoadjuvant in inducing immune responses to soluble     proteins. Biochem. Biophys. Res. Commun. 240:793-797. -   Nakao M., Hazama M., Mayumi-Aono A., Hinuma S, and     Fujisawa Y. (1994) Immunotherapy of acute and recurrent herpes     simplex virus type 2 infection with an adjuvant-free form of     recombinant glycoprotein D-interleukin-2 fusion protein. J. Infect     Dis. 169:787-791. -   Nestle F. O., Alijagic S., Gilliet M., Sun V., Grabbe S., Dumer R.     et. al, (1998) Vaccination of melanoma patients with peptide- or     tumor lysate-pursued dendritic cells, Nature Med. 4:328-332. -   Noguchi Y., Noguchi T., Sata T., Yokoo Y., Itoh S., Yoshida M. et     al. (1991) Priming for in vitro and in vivo anti-human T     lymphotropic virus type 1 cellular immunity by virus-related protein     reconstituted into liposome. J. Immunol. 146: 3599-3603. -   Nomoto et al., “Cloning and characterization of the alternative     promoter regions of the human LIMK2 gene responsible for alternative     transcripts with tissue-specific expression,” Gene, 236(2):259-271,     1999. -   Obert M., Plkeuger H., Hanagarth II. G., Schulte-Monting J.,     Wiesmuller K. H., Braun D. G., et al. (1998) Protection of mice     against SV40 tumors by Pam3Cys, MTP-PE and Pam3Cys conjugated with     the SV40 T antigen-derived peptide K(698)-T(708). Vaccine 16:     161-169. -   O'Neil, B. H., et al., (1993) J. Immunol. 151:1410-1418. -   Pardoll, D. M. (2000) Clin. Immunol. 95 (1): S44-S62. -   Parkhurst M. R., Fitzgerald E. B., Southwood S., Sette A.,     Rosenberg S. A. and Kawakami Y. (1998) Identification of a shared     HLA-A*020-restricted T-cell epitope from the melanoma antigen     tyrosinase related protein 2 (TRP2). Cancer Res. 58:4895-4901. -   Partidos C. D., Vohra P. and Stewart M. W. (1996) Priming of measles     virus-specific CTL responses after immunization with a CTL epitope     linked to a fusogenic peptide. Virology 215: 107-110. -   Peoples, G. E., Anderson, B. W., Fisk, B., Kudelka, A. P.,     Wharton, J. T., and Ioannides, C. G. Ovarian cancer-associated     lymphocytes recognize folate binding protein (FBP) peptides. Ann.     Surg Oncol., 5(8):743-750, 1998. -   Peoples, G. E., Anderson, B. W., Murray, J. L., Kudelka, A. P.,     Eberlein, T. J., Wharton, J. T., and Ioannides, C. G. Vaccine     implications of folate binding protein in epithelial cancers. Clin.     Cancer Res., 5:4214-4223, 1999. -   Pietersz, G. A. et al. (2000) Generation of cellular immune     responses to antigenic tumor peptides. Cell. Mol. Life. Sci.     57:290-310. -   Pietersz G. A., Wenjun L., Popovski V., Caruana J. A.     Apostolopoulos V. and McKenzie I. F. C. (1998) Parameters in using     mannan-fusion protein (M-FP) to induce cellular immunity. Cancer     Immunol. Immunother. 45: 321-326. -   Rammensee H. G. (1995) Chemistry of peptides associated with MHC     class I and class 1 molecules. Curr. Opin. Immunol. 7:85-96. -   Rammensee H. G., Friede T. and Stevanovic S. (1995) MHC ligands and     peptide motiffs: first listing. Immunogenetics 41:178-228. -   Reddish M., MacLean G. D., Koganty R. R., Kan-Mitchell J., Jones V.,     Mitchell M. S. et al. (1998) Anti-MUC1 class I restricted CTLs in     metastatic breast cancer patients immunized with a synthetic MUC1     peptide. Int. J. Cancer 76: 817-823. -   Remington's Pharmaceutical Sciences, 18th Ed. Mack Printing Company,     1990. -   Retrig, W. J., Cordon-Cardo, C., Koulos, J. P., Lewis, J. L.,     Oertgen, H. F., and Old, L. J. Cell surface antigens of human     trophoblast and choriocarcinoma defined by monoclonal antibodies.     Int. J. Cancer 35: 469-475, 1985. -   Reynolds S. R., Celis E., Sette A., Oratz R., Shapiro R. L.,     Johnston D. et al, (1998) HLA-independent heterogeneity of CDS+ T     cell responses to MAGE-3, Melan-A/MART-1, gp 100, tyronsinase, MCIR     and TRP-2 in vaccine-treated melanoma patients, J. Immunol.     161:6970-6976. -   Rimmelzwaan G. F., Baars M., van Beek R., van Amerongen G.,     Lovgren-Bengtsson K., Claas E. C. et al. (1997) Induction of     protective immunity against influenza virus in a macaque model:     comparison of conventional and iscom vaccines. J. Gen. Virol.     78:757-765. -   Rivoltini L., Squarcina P., Loftus D. J., Castelli C., Tarsini P.,     Mazzocchi A. et al. -   (1999) A superagonist variant of peptide—MART1/Melan A27-35 elicits     anti-melanoma CD8+ T cells with enhanced functional characteristics:     implication for more effective immunotherapy. Cancer Res.     59:301-306. -   Rosenberg, S. A., et al., (1986) Science 3233:1318-1321. -   Rosenberg, S. A., et al., (1988) N Engl J Med 319:1676-1680. -   Rosenberg S. A. (1992) J. Clin. Oncol. 10:180-199. -   Rosenberg, S. A. (2000) Cancer J. 6, Supp. 2: S142-S149. -   Rosenberg S. A., Yang J. C., Schwartzentruber D. J., Hwu P.,     Marincola F. M., Topalian S. L. et al. (1998) Immunologic and     therapeutic evaluation of a synthetic peptide vaccine for the     treatment of patients with metastatic melanoma, Nature Med. 4:     321-327. -   Rowell J. F., Ruff A. L., Guarnieri G. G., Stavely-O'Carroll K., Lin     X., Tang J. et al. (1995) Lysosome-associated membrane     protein-1-mediated targeting of the HIV-1 envelope protein to an     endosomal/lysosomal compartment enhances its presentation to MHC     class II-restricted T cells. J. Immunol. 155: 1818-1828. -   Rowse G. J., Tempero R. M., VanLith M. L., Hillingsworth M. A. and     Gendler S. J. (1998) Tolerance and immunity to MUC1 in a human MUC1     transgenic murine model. Cancer Res. 58: 315-321. -   Samuel J., Budynski W. A., Reddish M. A., Ding L., Zimmermann G. I.,     Krantz M. I. et al. (1998) Immunogenicity and antitumour activity of     a liposomal MUCI peptide-based vaccine. Int. J. Cancer 75: 295-302. -   Schutze-Redelmeier M. P., Gournier H., Garcia-Pons F., Moussa M.,     Joliot A. H., Volovitch M. et al. (1996) Introduction of exogenous     antigens into the MHC class I processing and presentation pathway by     Drosophila antennapedia homeodomain primes cytotoxic T. cells in     vivo. J. Immunol. 157:650-655. -   Sensi, M., et al., (1993) J. Exp. Med. 178:1231-1246. -   Sjolander A., van't Land B. and Lovgren Bengtsson K., (1997) Iscoms     containing purified Quillaja saponins upregulate both Th1-like and     Th2-like immune responses. Cell Immunol. 10:69-76. -   Speir J. A., Abdel-Motal U. M., Jondal M. and Wilson I. A. (1999)     Crystal structure of an MHC class I presented glycopeptide that     generates carbohydrates-specific CTL. Immunity 10:51-61. -   Stenmark H., Moskaug J. O., Madshus I. H., Sandvig K. and     Olsnes S. (1991) Peptices fused on the amino-terminal end of     diphtheria toxin are translocated to the cytosol. J. Cell Biol. 113:     1025-1032. -   Suzue K., Zhou X., Eisen H. N. and Young R. A. (1997) Heat shock     fusion proteins as vehicles for antigen delivery into the major     histocompatibility complex class I presentation pathway. Proc. NaI.     Acad. Sci. USA 94: 13146-13151. -   Tao M. H. and Levy R. (1993) Idiotype/granulocyte-macrophage     colony-stimulating factor fusion protein as a vaccine: for B-cell     lymphoma. Nature 362:755-758. -   Tarpey I., Stacey S. N., McIndoe A. and Davies D. H. (1996) Priming     in vivo and quantification in vitro of class I MHC-restricted     cytotoxic T cells to human papilloma virus type 11 early proteins     (E6 and E7) using immunostimulating complexes (ISCOMs). Vaccine 14:     230-236. -   Theobald M., Biggs J., Dittmer D., Levine A. J. and     Sherman L. A. (1995) Targeting p53 as a general tumor antigen. Proc.     Natl. Acad. Sci. USA 92: 11993-11997. -   Topalian, S. L., et al., (1989) J. Immunol. 142:3714-3725. -   Tsumaki et al., “Modular arrangement of cartilage- and neural     tissue-specific cis-elements in the mouse alpha2(XI) collagen     promoter,” J Biol. Chem. 273(36):22861-22864, 1998. -   Udono H. and Srivastava P. K. (1993) Heat shock protein 70     associated peptides elicit specific cancer immunity. J. Exp. Med.     178: 1391-1396. -   Van Der Burg S. H., Vissern M. J., Brandt R. M., Kast W. M. and     Melief C. J. (1996) Immunogenicity of peptices bound to MHC class 1     molecules depends on the MHC peptide complex stability. J. Immunol.     156:3308-3314. -   Villacres-Eriksson M. (1995) Antigen presentation by naïve     macrophages, dendritic cells and B cells primed T lymphocytes and     their cytokine production following exposure to immunostimulating     complexes. Clin. Exp. Immunol. 102:46-52. -   Vogel F. R. and Powell M. F. (1995) A compendium of vaccine     adjuvants and excipients. In: Vaccine Deign: The Subunit and     Adjuvant Approach. Pharmaceutical Biotechnology, vol. 6, pp.     141-228, Powell M. F. and Newman M. J. (eds), Plenum Press, New     York. -   Weitman, S. D., Lark, R. H., Coney, L. R., Fort, D. W., Frasca, V.,     Zurawski, V. R., and Kamen, B. A. Distribution of the folate     receptor GP38 in normal and malignant cell lines and tissues. Cancer     Res. 52: 3396-3401, 1992. -   Wu et al., “Promoter-dependent tissue-specific expressive nature of     imprinting gene, insulin-like growth factor II, in human tissues,”     Biochem Biophys Res Commun. 233(1):221-226, 1997. -   Wu T. C., Guarnieri F. G., Staveley-O'Carroll K. F., Viscidi R. P.,     Levitsky H. I., Hedrick I., et al. (1995) Engineering an     intracellular pathway for major histocompatibility complex class II     presentation of antigens. Proc. Natl. Acad. Sci. USA 92:11671-11675. -   Xing P.-X., Tjandra J. J., Stacker S. A., T. J. G., Thompson C. H.,     McLaughlin P. J. et al, (1989) Monoclonal antibodies reactive with     mucin expressed in breast cancer. Immunol. Cell. Biol. 67: 183-195. -   Xing P.-X., Apostolopoulos V., Michaels M., Prenzoska J., Bishop J.     and McKenzie I. F. C. (1995) Phase I study of synthetic MUC1     peptides in cancer. Int:J. OncoL 6:1283-1289. -   Xing P.-X, Reynolds K., Tjandra J. J., Tang X. L. and     McKenzie I. F. C. (1990) Synthetic peptides reactive with anti-human     milk fat globule membrane monoclonal antibodies. Cancer Res.     50:89-96. -   Zeng Z. H., Castano A. R., Segelke B. W., Stura E. A. Peterson P. A.     and Wilson I. A. (1997) Crystal structure of mouse CD1: an MHC-like     fold with a large hydrophobic binding groove. Science 277: 339-345. -   Zhang S., Graeber L. A., Helling F., Ragupathi G., Adluri S.,     Lloyd K. O. et al. (1996) Augmenting the immunogenicity of synthetic     MUC1 peptide vaccines in mice. Cancer Res. 56: 3315-3319. -   Zhao-Emonet et al., “The equine herpes virus 4 thymidine kinase is a     better suicide gene than the human herpes virus 1 thymidine kinase,”     Gene Ther. 6(9):1638-1642, 1999. -   Zhu X., Zhao X., Burkholder W. F., Gragerov A., Ogata C. M.,     Gottesman M. E. et al. (1996) Structural analysis of substrate     binding by the molecular chaperone DnaK. Science 272: 1606-1614.

All of the compositions and methods disclosed and claimed herein can be made and executed without undue experimentation in light of the present disclosure. While the compositions and methods of this invention have been described in terms of preferred embodiments, it will be apparent to those of skill in the art that variations may be applied to the compositions and methods and in the steps or in the sequence of steps of the method described herein without departing from the concept, spirit and scope of the invention. More specifically, it will be apparent that certain agents which are both chemically and physiologically related may be substituted for the agents described herein while the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the invention as described herein. 

1. A composition comprising a peptide comprising the amino acid sequence of a variant of a peptide fragment of the polypeptide of SEQ ID NO:10, wherein said peptide is of at least 9 amino acids or up to about 30 amino acids, comprises SEQ ID NO:268 but for substitution therein of the amino acids at positions 5 and 7 thereof by phenylalanine and threonine, respectively, and further comprises from 0 to 21 additional contiguous amino acids of the amino acid sequences of SEQ ID NO:10 that flank SEQ ID NO:268 as contained therein.
 2. The composition of claim 1, further comprising a peptide fragment of the polypeptide of SEQ ID NO:10, wherein said peptide fragment is of at least 9 amino acids or up to about 30 amino acids, and comprises SEQ ID NO:268.
 3. The composition of claim 1 in a pharmaceutically acceptable excipient.
 4. The composition of claim 1, further comprising a second peptide comprising the amino acid sequence of a variant of a peptide fragment of the polypeptide of SEQ ID NO:10, wherein said second peptide is of at least 9 amino acids or up to about 30 amino acids and comprises an amino acid sequence selected from the group consisting of: a) the amino acid sequence of SEQ ID NO:268 but for substitution therein of he amino acid at position 5 thereof by phenylalanine; b) the amino acid sequence of SEQ ID NO:268 but for substitution therein of the amino acids at positions 1, 5, 6, and 7 thereof by phenylalanine, phenylalanine, alanine, and threonine, respectively; c) the amino acid sequence of SEQ ID NO:268 but for substitution therein of the amino acids at positions 6 and 7 thereof by alanine and threonine, respectively; d) the amino acid sequence of SEQ ID NO:268 but for substitution therein of the amino acids at positions 1 and 7 thereof by phenylalanine and threonine, respectively; e) the amino acid sequence of SEQ ID NO:268 but for substitution therein of the amino acids at positions 1, 5, and 7 thereof by phenylalanine, phenylalanine, and threonine, respectively; and f) the amino acid sequence of SEQ ID NO:268 but for substitution therein of the amino acids at positions 1 and 7 thereof by glycine and threonine, respectively.
 5. A composition comprising a peptide consisting of the amino acid sequence of SEQ ID NO:5.
 6. The composition of claim 5, further comprising a second peptide consisting of the amino acid sequence of SEQ ID NO:8.
 7. The composition of claim 5 in a pharmaceutically acceptable excipient.
 8. The composition of claim 6 in a pharmaceutically acceptable excipient.
 9. The composition of claim 1, wherein the peptide is capable of inducing an immune response against a folate binding protein having the amino acid sequence of SEQ ID NO:10 or a cell comprising said folate binding protein.
 10. The composition of claim 1, wherein the peptide is capable of a stimulating a cytotoxic T lymphocyte.
 11. The composition of claim 10, wherein the cytotoxic T lymphocyte is also capable of recognizing a folate binding protein epitope consisting of the amino acid sequence of SEQ ID NO:268. 